Talent.com
Site Reliability Engineer
Site Reliability EngineerApple • Hyderabad / Secunderabad, Telangana, India
Site Reliability Engineer

Site Reliability Engineer

Apple • Hyderabad / Secunderabad, Telangana, India
30+ days ago
Job description
Summary

Imagine what we could do together. At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. The people here at Apple don't just build products — they craft the kind of wonder that's revolutionized entire industries. It's the diversity of those people and their ideas that encourages the innovation that runs through everything we do, from amazing technology to industry-leading environmental efforts.

Apple's B2B team manages critical integrations with Apple's supply chain partners such as manufacturers, logistics providers, banks, resellers and business customers. We are seeking a technically hands on individual with a real passion for programming and automation.

Join our dynamic team as a Software Reliability Engineer (SRE) and dive into innovative work culture fueled by machine learning, anomaly detection and threat detection. Collaborate with a highly motivated team of professionals who push boundaries and delivering exceptional results. This position offers an exciting opportunity to build your career as an SRE in a supportive environment, where continuous learning and professional development are prioritized.

Description

As an SRE at Apple, you will be part of a team who will implement and maintain best-in-class devops practices, work on complex technical challenges related to scalability, reliability and performance of Apple B2B systems. You will be managing the lifecycle of machine learning models in production and non-production environment. You will be responsible for continuously assessing and improving system processes, detecting anomalies, identify the areas of optimization and implementing solutions to enhance system reliability and performance. You should have a passion for programming and a good conceptual understanding of the operating environment - JVM, Operating System, File Systems, Network Protocols. Technical expertise, strong communication skills and teamwork are essential requirements for this role as it involves working with both technical and non-technical groups within Apple and externally with our supply chain partners.

We are looking for a Senior DevOps Engineer who can design, build, and scale a modern DevOps and SRE ecosystem from scratch.

This role requires deep hands-on expertise, strong architectural thinking, and the ability to establish GitOps-driven, cloud-native CI/CD platforms using the latest technologies.

The ideal candidate will act as a foundational engineer and technical leader, defining standards, tooling, automation, and reliability practices across the organization.

Minimum Qualifications

  • At least 5 years of prior demonstrated experience in a Site Reliability Engineering, DevOps(Must), or an Infrastructure-focused role.
  • Designing and Building DevOps platforms end-to-end alongwith SRE/Platform Engineering.
  • Proven experience in building DevOps platforms from scratch.
  • Applied Experience on GitOps-based deployment models (ArgoCD / Flux)
  • Establish Infrastructure as Code (IaC) practices.
  • Build and operate Kubernetes platforms (EKS / AKS / GKE / OpenShift)
  • Experience working in large-scale, distributed systems
  • Strong problem-solving and architectural skills
  • Proficiency in one or more programming languages (eg. Python)
  • Support of internet-facing production services and distributed systems via deployments, onCall and Incident Management. Lead incident response, RCA, and reliability improvements.
  • Proficiency in implementing and coordinating telemetry using monitoring and observability tools like Splunk, Grafana, and Prometheus, or similar.
  • Experience in solving and resolving issues in Kubernetes from both an operating system and application perspective.
  • Building and operating container orchestrating systems like Kubernetes or EKS.
  • Strong programming experience in Java building web, middleware or backend applications.
  • Deep understanding of Oracle or similar relational databases and NoSQL databases such as MongoDB.
  • Firsthand experience in performance tuning of applications and databases.
  • Knowledge of HTTP/S, TCP, DNS, web application load balancing.
  • Deep understanding of basic security concepts and protocols - authentication, authorization, signing, encryption, SSL/TLS, SSH/SFTP, PKI, X509 certificates and PGP.

Preferred Qualifications

  • Strong programming experience in Java for backend, middleware, or web applications
  • Experience with NoSQL databases (MongoDB, Cassandra, DynamoDB, etc.)
  • Deep understanding of relational databases (Oracle, PostgreSQL, MySQL, etc.)
  • Hands-on experience in performance tuning of applications and databases
  • Experience with advanced observability practices:
  • * Distributed tracing
  • * SLO/SLI design
  • * Error budgets
  • Prior experience in large-scale, highly distributed production environments
  • Experience with container orchestration internals (scheduler, CNI, CSI, etc.)
  • Knowledge of middleware platforms such as WebMethods Integration Server or similar.
  • Experience with multi-cloud or hybrid cloud environments
  • Familiarity with service mesh technologies (Istio, Linkerd, etc.)


Skills Required
Performance Tuning, Devops, Site Reliability Engineering
Create a job alert for this search

Site Reliability Engineer • Hyderabad / Secunderabad, Telangana, India

Similar jobs

Staff Site Reliability Engineer

The Hartford Indiahyderabad, telangana, in

The Safe Enablement team, a subset of the AI Platform Team, carries a mission of building site-reliable practices and guardrails into the platforms the AI Platform team builds and the Analytics Com...Show more

 • Promoted

Site Reliability Engineer III [T500-24447]

McDonald's Global Office in Indiahyderabad, India

One of the world’s largest employers with locations in more than 100 countries, McDonald’s Corporation has corporate opportunities in Hyderabad.Our global offices serve as dynamic innovation and op...Show more

 • Promoted

Site Reliability Engineer Iii T500-24447

McDonald's Global Office in IndiaHyderabad, Republic Of India, IN

One of the world’s largest employers with locations in more than 100 countries, McDonald’s Corporation has corporate opportunities in Hyderabad.Our global offices serve as dynamic innovation and op...Show more

 • Promoted

Sr Engineer, Site Reliability

TMUS Global SolutionsHyderabad, India

The Senior Engineer, Site Reliability (SRE) will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms.This role is focused on ...Show more

 • Promoted

Engineer - Site Relibility - FPT

Talent500 INCHyderabad, India

Engineer - Site Reliability - FPT.As a Site Reliability Engineer, youll play a crucial role in keeping our digital backbone running seamlessly for millions of customers.Your mission: reduce inciden...Show more

 • Promoted

Site Reliability Engineer III

McDonalds in IndiaHyderabad, India

We are seeking an exceptional Senior Data Product Engineering SRE to lead the development and operational excellence of our data products that deliver insights and drive critical business decisions...Show more

 • Promoted

Senior Site Reliability Engineer

The Hartford IndiaHyderabad, Republic Of India, IN

The Safe Enablement team, a subset of the AI Platform Team, carries a mission of building site-reliable practices and guardrails into the platforms the AI Platform team builds and the Analytics Com...Show more

 • Promoted

Lead Site Reliability Engineer

Concentrixhyderabad, telangana, in

As a Lead Site Reliability Engineer, you will own the reliability and availability of our production systems.You will champion SRE principles across engineering teams — defining SLOs, managing erro...Show more

 • Promoted

Site Reliability Engineer III [T500-24284]

McDonald's Global Office in Indiahyderabad, India

One of the world’s largest employers with locations in more than 100 countries, McDonald’s Corporation has corporate opportunities in Hyderabad.Our global offices serve as dynamic innovation and op...Show more

 • Promoted

Principal Engineer, Site Reliability

TMUS Global SolutionsHyderabad, India

The Principal Engineer, Site Reliability (SRE) will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms.This role is focused ...Show more

 • Promoted

Site Reliability Engineer

USThyderabad, telangana, in

SRE Operations Avaloq Support:.Job descriptionRole & responsibilities.Provide production support and troubleshooting for the Avaloq Banking Suite platform, ensuring seamless operations and resolvin...Show more

 • Promoted

Site Reliability Engineer

nexoceanhyderabad, telangana, in

Skills Required: Kubernetes, Terraform, Ansible, ARM Templates, AWS, GCP, Azure, Linux, CI/CD, Python, Bash, Prometheus, Grafana, SRE, Site Reliability.Experience Range: 5 - 15 years.The Senior Sit...Show more

 • Promoted

Site Reliability Engineer

The Hartford Indiahyderabad, India

Our client is a leader in property and casualty insurance, employee benefits and mutual funds.One of the largest insurers in the United States with many decades of expertise, this company is widely...Show more

 • Promoted

Site Reliability Engineer (SRE)

BayOne SolutionsHyderabad, Republic Of India, IN

This is an exciting opportunity to work on.Commerce platforms used by millions of customers worldwide.In this role, you will contribute to building and operating.Control Plane and Observability.Thi...Show more

 • Promoted

Lead Site Reliability Engineer

TMUS Global SolutionsHyderabad, Republic Of India, IN

NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more

 • Promoted

Site Reliability Engineer Iii T500-24284

McDonald's Global Office in IndiaHyderabad, Republic Of India, IN

One of the world’s largest employers with locations in more than 100 countries, McDonald’s Corporation has corporate opportunities in Hyderabad.Our global offices serve as dynamic innovation and op...Show more

 • Promoted

Site reliability engineer

AnonymousHyderabad, Andhra Pradesh, India

SRE Operations Avaloq Support:.Job description Role & responsibilities • Provide production support and troubleshooting for the Avaloq Banking Suite platform, ensuring seamless operations and resol...Show more

 • Promoted

Site reliability engineer

NexoceanHyderabad, Andhra Pradesh, India

Skills Required: Kubernetes, Terraform, Ansible, ARM Templates, AWS, GCP, Azure, Linux, CI/CD, Python, Bash, Prometheus, Grafana, SRE, Site Reliability.Experience Range: 5 - 15 years.The Senior Sit...Show more