Talent.com
This job offer is not available in your country.
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Cubic Transportation SystemsIndia
30+ days ago
Job description

Hiring Principal Site Reliability Engineer

Experience : 12+ Years

Location : Hyderabad

Notice : Immediate to 30 Days

We're seeking an experienced

Site Reliability Engineer (SRE)

to ensure our services are robust, scalable, secure, and maintainable. You will blend software engineering and systems operations to automate processes, monitor performance, lead incident response, and work closely with engineering teams to enhance service availability and reliability, while bringing efficiencies to operational processes. We’re looking for proactive problem-solvers with strong technical and communication skills who can also effectively support and troubleshoot project operational issues.

Key Responsibilities

Design, deploy, and maintain scalable, secure applications and infrastructure in cloud or hybrid environments

Implement and manage robust monitoring, alerting, and observability systems

Automate recurrent operational tasks using scripts (e.g., Python) and Infrastructure-as-Code tools (e.g., Terraform)

Collaborate with engineers to build highly available, reliable, deployable systems, establishing guardrails around SLOs, SLIs, and error budgets

Own incident response by participating in on-call rotations, conducting RCAs, and implementing preventive measures and self-healing solutions

Conduct performance tuning, capacity planning, and efficient disaster recovery design for strong Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO)

Reduce manual toil in security compliance and patching processes through automation

Support project teams in troubleshooting and resolving operational issues across development, testing, and production environments

Provide guidance and operational support during project rollouts and infrastructure changes to ensure reliability and uptime

Collaborate with senior stakeholders, internal and external, to communicate technical concepts, resolve problems, and influence decision-making on technical matters

Work closely with the product team to stay informed about evolving system design, business logic, and transaction flows to ensure reliability and operational readiness across services

Identify and address organization-wide gaps in the SRE domain and develop implementable solutions that contribute to reliability and operational excellence

Required Qualifications

Bachelor’s degree in Computer Science, Engineering, or equivalent

12+ years as an SRE, DevOps, or related role managing large-scale solutions or platforms

Proficient in scripting (PowerShell, Python, Go, Bash) and solid understanding of coding / development principles

Hands-on experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes)

Experienced with monitoring, logging, alerting, and observability tools

Familiar with CI / CD pipelines and infrastructure tooling (e.g., Jenkins, GitLab CI / CD, Argo CD)

Proficiency in Agile methodologies, such as SCRUM

Strong problem-solving and debugging skills, especially in high-pressure, production-critical environments

Strong collaboration and communication skills

Desired Qualifications

Experience with Terraform and other Infrastructure-as-Code tools

SRE-specific certifications from AWS, GCP, or Azure

Experience shaping and scaling SRE practices

Experience mentoring teams and fostering a strong reliability culture across the organization

Create a job alert for this search

Site Reliability Engineer • India

Related jobs
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ExasoftIndia, India
Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 4 days ago
  • Promoted
Senior Site Reliability Engineer- ELK Expert

Senior Site Reliability Engineer- ELK Expert

iVedha Inc.Nagpur, IN
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer - Chaos Management

Site Reliability Engineer - Chaos Management

Xebianagpur, maharashtra, in
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 11 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

BirlasoftIndia
Responsibilities : Be primarily responsible for providing production, operations support and application administration to business and web applications, 3rd party applications and related ecosystem...Show moreLast updated: 28 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Insight GlobalIndia
USD Must be able to join within 30 days or less! Job Description : An employer is looking for an SRE to join their enterprise level SRE team. They are building a specialized team of Senior Site Relia...Show moreLast updated: 30+ days ago
  • Promoted
Staff Site Reliability Engineer (Observability)

Staff Site Reliability Engineer (Observability)

Palo Alto NetworksIndia
At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show moreLast updated: 9 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

UplersNagpur, IN
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 28 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ViewSonicIndia
Job Requirements : Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding o...Show moreLast updated: 21 days ago
  • Promoted
Sr Site Reliability Engineer

Sr Site Reliability Engineer

Media.netIndia
Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 3 days ago
  • Promoted
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Rakuten IndiaIndia
Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show moreLast updated: 21 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

HaysIndia
Required skills and qualifications Exp- 7-12 Years • Experience : Proven experience in technical support or engineering, preferably in AI / ML / GenAI environments. Technical Proficiency : Expertise in Ge...Show moreLast updated: 29 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Luxoft IndiaIndia
We are looking for an experienced technical developer to work for one of our client from the banking industry.Project goal is to maintain and develop solutions. Design, develop, and improve the digi...Show moreLast updated: 21 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

XebiaIndia, India
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

BayOne Solutionsnagpur, maharashtra, in
Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 3 days ago
  • Promoted
Principal Engineer, Site Reliability [T500-20295]

Principal Engineer, Site Reliability [T500-20295]

ANSRIndia
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 11 days ago
  • Promoted
Engineer, Site Reliability [T500-20504]

Engineer, Site Reliability [T500-20504]

ANSRIndia
ANSR is hiring for one of its clients.About T-Mobile : T-Mobile US, Inc.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its st...Show moreLast updated: 11 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

SynechronIndia
Good-day, We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer Job Location : Synechron. Notice : Immediate Joiner About Company : At Synechron, we belie...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ConcordIndia, India
Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 21 days ago