Role Summary :
This position within Platform Reliability Engineering and Management leveraging SRE Principles and Practices based out of Mumbai location. This role is looking for a multi skilled professional with strong Hands on technical leadership in Observability frameworks, people management skills to deliver critical services ensuring Jefferies operates a highly stable, reliable, and resilient front-to-back plant.
Responsibilities :
- You will be part of the team creating the observability standards, tools, and practices for teams.
- You will design, build, and implement our new observability platform focusing on synthetics, metrics, log ingest and tracing.
- You will collaborate with technical teams to understand their needs and help define and implement observability solutions to meet them.
- You will provide technical mentorship to engineering teams helping them to align with best practices and open standards to own their observability.
- You will promote opportunities for enhanced observability and refactoring and identify areas of optimization.
- You will create documentation and standards to define the observability practice and accelerate adoption in the organization.
Soft Skill / Personal :
Strong communicator and collaborator.communicate clearly and effectively with stakeholders from development, ops, support, and infrastructure.Passionate about observability, metrics, and traces.Someone who approaches work with an automation-first mindset.Continuous learner who keeps up with the latest technologies and best practices in observability engineering.Technical Requirements :
plus years of hands-on experience with strong development skills. Should extensive experience building tools and automationUnderstanding of SRE principles.Extensive experience using observability solutions like New Relic, Dynatrace, Data Dog, OTEL / Grafana, or similar.Used open standards such as Open Telemetry (OTEL), Open Metrics, and Open Tracing to achieve observability goals.Have experience designing and implementing monitoring solutions incorporating synthetic testing, metrics, traces, and logging.Familiar with cloud infrastructure and platform services (CIPS) : Microsoft Azure, Amazon Web Services (AWS), and Google Cloud Platform (GCP).Have a strong understanding of distributed systems. proficient in one or more programming languages, such as JavaScript, Python, and C.