About Client : -
Our client is a French multinational information technology (IT) services and consulting company, headquartered in Paris, France. Founded in 1967, It has been a leader in business transformation for over 50 years, leveraging technology to address a wide range of business needs, from strategy and design to managing operations.
The company is committed to unleashing human energy through technology for an inclusive and sustainable future, helping organizations accelerate their transition to a digital and sustainable world.
They provide a variety of services, including consulting, technology, professional, and outsourcing services.
Job Details : -
location : Pan India
Mode Of Work : Hybrid
Notice Period : Immediate Joiners
Experience : 4-6yrs
Type Of Hire : Contract to Hire
Job Description : -
highly skilled and motivated Reliability Engineer for SRE team who can contribute the organization’s reliability and resiliency objectives.
Candidate must have strong technical expertise and leadership skills and will play a key role in maintaining reliability goals and continuous improvement.
Key Responsibilities :
Define and implement SRE strategies and best practices in alignment with organizational objectives.
Monitor client's service level agreements (SLAs), service level objectives (SLOs) and service level indicators (SLIs).
Lead initiatives to improve system reliability, availability, scalability and performance.
Collaborate with development and operations teams to ensure reliability and resiliency goals are met.
Implement and improve incident management processes to minimize downtime and ensure timely resolutions.
Review and contribute to the architecture of critical systems, ensuring they meet reliability and performance goals.
Drive observability practices by implementing robust monitoring, logging, and alerting systems.
Skills required :
Proficiency in writing Splunk Queries and Alerts is a must.
Hands on experience with at least one APM tool (NewRelic, AppDynamics, Honeycomb, Data Dog) is a must
Expertise in automation tools and scripting languages (Python Or JavaScript) is a must
Proficiency in scripting languages (Python or NodeJs) a must.
Proficiency in any cloud platforms (AWS, GCP, Azure) is a must.
Strong understanding of distributed systems, microservices architecture, and container orchestration tools (e.g. Kubernetes).
Experience with monitoring tools like Prometheus, Grafana a must.
Site Reliability Engineer • Coimbatore, IN