Talent.com
This job offer is not available in your country.
Site Reliability Engineer

Site Reliability Engineer

ElgebraBangalore
2 days ago
Job description

Role Overview :

We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our client, Qincline. The ideal candidate will have 7 or more years of dedicated experience in Site Reliability Engineering or a closely related discipline. This pivotal role requires a strong focus on ensuring the reliability, scalability, performance, and operational efficiency of large-scale, complex production systems. You'll be instrumental in bridging the gap between development and operations by applying engineering principles to operational challenges.

Key Responsibilities :

Reliability & Performance Engineering :

  • System Reliability : Design, build, and maintain robust, fault-tolerant production systems and infrastructure to meet stringent Service Level Objectives (SLOs).
  • Performance Tuning : Proactively identify and resolve performance bottlenecks across the entire application stack, from infrastructure to application code.
  • Automation : Develop and implement automation for operational tasks, infrastructure provisioning, deployment, and monitoring to eliminate manual toil.
  • Capacity Planning : Collaborate with development teams on capacity planning, forecasting demand, and ensuring the infrastructure can scale efficiently to meet future business needs.

Operations & Incident Management :

  • Monitoring & Alerting : Establish and maintain comprehensive monitoring, logging, and alerting systems to gain deep visibility into system health and performance (e.g., using Prometheus, Grafana, ELK Stack, etc.).
  • Incident Response : Serve as a key responder during critical incidents, performing rapid triage, mitigation, and recovery.
  • Post-Mortems & RCA : Lead detailed Post-Mortem and Root Cause Analysis (RCA) processes for all significant incidents, ensuring that permanent fixes and preventative measures are implemented to prevent recurrence.
  • On-Call : Participate in a periodic on-call rotation to provide 24 / 7 support for critical production systems.
  • Tooling & Infrastructure :

  • CI / CD & DevOps : Enhance and manage CI / CD pipelines to facilitate fast, reliable, and automated software releases.
  • Containerization & Orchestration : Manage and optimize containerized environments using Docker and Kubernetes.
  • Infrastructure as Code (IaC) : Utilize IaC tools (e.g., Terraform, Ansible) to provision and manage infrastructure in a repeatable and documented manner.
  • Required Skills & Experience :

    Core Experience (7+ Years) :

  • Minimum 7 years of hands-on experience in a Site Reliability Engineer, DevOps Engineer, or Production Engineer role supporting high-availability, mission-critical production environments.
  • Deep expertise in establishing and improving system monitoring, logging, alerting, and telemetry practices.
  • Demonstrated experience with formal Incident Management processes and leading thorough Root Cause Analysis (RCA).
  • Technical Expertise :

  • Cloud Platforms : Extensive, hands-on experience with at least one major cloud provider (e.g., AWS, Azure, or GCP). This includes managing compute, networking, storage, and managed services.
  • Scripting & Programming : Strong proficiency in scripting and programming languages, with mandatory expertise in Python and Shell scripting for automation and tooling.
  • DevOps Tooling : Proven experience with CI / CD pipeline tools (e.g., Jenkins, GitLab CI, Azure DevOps), Git, and artifact repositories.
  • Containerization : Expert-level knowledge of Docker and robust experience with orchestrating large-scale deployments using Kubernetes.
  • Operating Systems : Strong command of Linux / Unix operating systems and networking fundamentals (TCP / IP, DNS, Load Balancing).
  • Desired Qualifications (Good to Have) :

  • Experience with configuration management tools (e.g., Ansible, Chef, Puppet).
  • Familiarity with service mesh technologies (e.g., Istio, Linkerd).
  • Knowledge of database administration and performance tuning (SQL / NoSQL).
  • Certifications related to SRE, Cloud (e.g., AWS Certified DevOps Engineer), or Kubernetes (CKA, CKAD).
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Bangalore

    Related jobs
    • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    ConfidentialBengaluru / Bangalore
    Ensuring the reliability of software systems by designing, implementing, and maintaining scalable and reliable infrastructure. Developing automation tools and scripts to streamline operational tasks...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    AIONBengaluru, KA, IN
    Quick Apply
    AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance,...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - CCNP / CCIE

    Site Reliability Engineer - CCNP / CCIE

    Systal Technology SolutionsBangalore
    Site Reliability Engineer Competitive Salary & Benefits Bangalore Systal is a global managed network and security service and tra...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions in...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synechronbangalore, karnataka, in
    We have immediate opportunity for.SRE (Senior Site Reliability Engineer) 5 to 9 years.SRE (Senior Site Reliability Engineer). We began life in 2001 as a small, self-funded team of technology special...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Core Minds Tech SOlutionsHosur
    Job Description : - Engage with our product teams to understand requirements, design, and implement resilient and scalable infrastructure solutions&l...Show moreLast updated: 30+ days ago
    • Promoted
    LSEG - Site Reliability Engineer

    LSEG - Site Reliability Engineer

    REFINITIV INDIA SHARED SERVICES PRIVATE LIMITEDBangalore
    LSEG is a leading global financial markets infrastructure and data provider.Our purpose is driving financial stability, empowering economies and enabling customers to create sustainable growth.Our ...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Aqilea (formerly Soltia)Bangalore, Karnataka, India
    Quick Apply
    We are a consulting company with a bunch of technology-interested and happy people!.We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TavantBengaluru, Karnataka, India
    With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 25 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2hosur, tamil nadu, in
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    People Realm Recruitment Services Private LimitedBengaluru, Karnataka, India
    Job Title- Site Reliability Engineer.Desired Years of Experience - 5 - 14 Years of Relevant Experience.A Career with a Leading Global Investment Management Firm’s Technology Team.Our client, a lead...Show moreLast updated: 19 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Uplershosur, tamil nadu, in
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 23 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub ServicesBengaluru, Karnataka, India
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 5 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordBangalore, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 17 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    At ViewSonic Technologies, we’re passionate about building software that solves problems.We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availabilit...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Xebiabangalore, karnataka, in
    AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE).The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI / CD, monit...Show moreLast updated: 30+ days ago