Talent.com
This job offer is not available in your country.
Site Reliability Engineering Specialist

Site Reliability Engineering Specialist

BT Groupbangalore, karnataka, in
30+ days ago
Job description

Platform Stability and Reliability

  • Ensure the platform meets performance, availability, and reliability SLAs.
  • Proactively identify and resolve performance bottlenecks and risks in production environments.
  • Maintain and improve monitoring, logging, and alerting frameworks to detect and prevent incidents.

Incident Management

  • Act as the primary responder for critical incidents, ensuring rapid mitigation and resolution.
  • Conduct post-incident reviews and implement corrective actions to prevent recurrence.
  • Develop and maintain detailed runbooks and playbooks for operational excellence.
  • Automation and Efficiency

  • Build and maintain tools to automate routine tasks, such as deployments, scaling, and failover.
  • Contribute to CI / CD pipeline improvements for faster and more reliable software delivery.
  • Write and maintain Infrastructure as Code (IaC) using tools like Pulumi or Terraform to provision and manage resources.
  • Collaboration and Mentorship

  • Collaborate with SRE, CI / CD, Developer Experience, and Templates teams to improve the platform’s reliability and usability.
  • Mentor junior engineers by sharing knowledge and best practices in SRE and operational excellence.
  • Partner with developers to integrate observability and reliability into their applications.
  • Observability and Metrics

  • Implement and optimize observability tools like Dynatrace, Prometheus, or Grafana for deep visibility into system performance.
  • Define key metrics and dashboards to track the health and reliability of platform components.
  • Continuously analyze operational data to identify and prioritize areas for improvement.
  • Required :

  • 8+ years of experience in site reliability engineering, software engineering, or a related field.
  • Demonstrated expertise in managing and optimizing cloud-based environments, with 3+ years of experience in AWS.
  • Strong programming skills in one or more languages : Python, Java, Node.js, or TypeScript.
  • Hands-on experience with containerization and orchestration technologies (e.g., Kubernetes, Docker).
  • Proficiency in CI / CD practices and tools, such as GitLab, Jenkins, or similar.
  • Familiarity with monitoring, logging, and alerting tools; experience with Dynatrace is a plus.
  • Preferred :

  • Hands-on experience with Kubernetes (K8s) for container orchestration and deployment.
  • Familiarity with monitoring and observability tools like Dynatrace, Prometheus, or similar.
  • Exposure to agile development practices and collaborative environments.
  • Experience working with other cloud platforms (e.g., Azure or Google Cloud) is a plus.
  • Create a job alert for this search

    Site Reliability Engineering • bangalore, karnataka, in

    Related jobs
    • Promoted
    Site Reliability Engineering, Sr Staff

    Site Reliability Engineering, Sr Staff

    ConfidentialBengaluru / Bangalore
    SRE lead with capability to execute SRE lifecycle and automation process.Discover, design, and implement changes to existing IT infrastructure with a focus on improved reliability, performance, and...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    ConfidentialBengaluru / Bangalore
    Ensuring the reliability of software systems by designing, implementing, and maintaining scalable and reliable infrastructure. Developing automation tools and scripts to streamline operational tasks...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne Solutionsbangalore, karnataka, in
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 6 hours ago
    • Promoted
    Sr. Site Reliability Engineer [T500-20179]

    Sr. Site Reliability Engineer [T500-20179]

    Delta Air LinesBengaluru, Karnataka, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub ServicesBengaluru, Karnataka, India
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tavantbangalore, karnataka, in
    With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 26 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraBangalore
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Core Minds Tech SOlutionsHosur
    Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Aqilea (formerly Soltia)Bangalore, Karnataka, India
    Quick Apply
    We are a consulting company with a bunch of technology-interested and happy people!.We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer [T500-20012]

    Lead Site Reliability Engineer [T500-20012]

    ConfidentialBengaluru / Bangalore, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 9 days ago
    • Promoted
    Observability - Engineer Site Reliability [T500-20244]

    Observability - Engineer Site Reliability [T500-20244]

    Albertsons Companies IndiaBengaluru, Karnataka, India
    About Albertsons Companies Inc.As a leading food and drug retailer in the United States, Albertsons Companies, Inc.Our well-known banners across the United States, including Albertsons, Safeway, Vo...Show moreLast updated: 8 days ago
    • Promoted
    Lead Site Reliability Engineer [T500-20012]

    Lead Site Reliability Engineer [T500-20012]

    Delta Air LinesBengaluru, Karnataka, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 26 days ago
    • Promoted
    Site Reliability Engineer - OpenShift

    Site Reliability Engineer - OpenShift

    ConfidentialBengaluru / Bangalore
    Applies software engineering principles to the operations domain.Contributes to a service's codebase, writes automation that aids in the management of a service, and performs operational engineerin...Show moreLast updated: 4 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftBangalore, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 10 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaBengaluru, Karnataka, India
    Performance & Reliability Engineer ( Senior, Lead , Principal & Manager).Location : Pune, Chennai, Bangalore & Gurgaon.Role : Performance & Reliability Engineer. Job Location : Gurgaon, Chennai, Pune, ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineering Specialist

    Site Reliability Engineering Specialist

    BT GroupBengaluru, Karnataka, India
    Platform Stability and Reliability.Ensure the platform meets performance, availability, and reliability SLAs.Proactively identify and resolve performance bottlenecks and risks in production environ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    At ViewSonic Technologies, we’re passionate about building software that solves problems.We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availabilit...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebiahosur, tamil nadu, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersBengaluru, Karnataka, India
    Uplers is hiring for one of the clients.Role Details : Position : SRE (Oracle Cloud Infrastructure) Type : 10-month contract (possible extension) Mode : Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST Pol...Show moreLast updated: 24 days ago