Talent.com
This job offer is not available in your country.
Sr Site Reliability Engineer

Sr Site Reliability Engineer

ConfidentialNoida, India
7 days ago
Job description

We are seeking a Site Reliability Engineer (SRE) with a proven track record of self-regulation and extensive experience in modern DevOps practices. The ideal candidate will possess deep knowledge and hands-on experience with Grafana, GitLab, Terraform and Helm. This role demands a proactive and diligent individual capable of enhancing our infrastructure's reliability, scalability, and efficiency. Experience with programming languages such as Python is a necessity.

Job Responsibilities

  • Design, implement, and manage cloud infrastructure and applications with a focus on high availability, fault tolerance, and auto-scaling using Terraform.
  • Monitor, analyze, and ensure the performance and reliability of our systems with Dynatrace, implementing automated solutions to preemptively resolve potential issues.
  • Utilize GitLab for continuous integration / continuous deployment (CI / CD) pipelines, enhancing our deployment strategies to ensure seamless and reliable updates to our services.
  • Lead incident response and blameless post-mortems, driving root cause analysis and implementing preventive measures to mitigate future incidents.
  • Work closely with development teams to advocate for reliability and performance best practices, incorporating SRE principles into the software development lifecycle.
  • Develop and maintain documentation for system architecture, processes, and disaster recovery plans.
  • Stay up-to-date with the latest industry trends and technologies, continuously seeking to improve our systems and processes.
  • Mentor junior staff and enable them for success.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or related field, or equivalent work experience.
  • At least 5+ years of experience in a Site Reliability Engineering or DevOps role, with a demonstrated ability to work independently and self-regulate.
  • Strong experience with infrastructure as code (IaC) tools, specifically Terraform.
  • Strong background in maintaining multiple production environments mission critical components.
  • Proficient in monitoring and observability tools, with substantial experience in Dynatrace.
  • Extensive knowledge of CI / CD processes and tools, with a focus on GitLab.
  • Proficient with containerization and orchestration technologies (e.g., Docker, Kubernetes).
  • Experience in one or more of the following programming languages : Elixir, Golang, or Python.
  • Excellent problem-solving skills, with the ability to analyze complex systems and identify performance bottlenecks and areas for improvement.
  • Strong communication and collaboration skills, capable of working effectively across different teams and departments.
  • Show more

    Show less

    Skills Required

    Docker, Terraform, Dynatrace, Gitlab, Grafana, Helm, Kubernetes, Python

    Create a job alert for this search

    Site Reliability Engineer • Noida, India

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub Servicesnarela, delhi, in
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaMeerut, IN
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CorroHealthNoida, Uttar Pradesh, India
    We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team.The ideal candidate will have a deep understanding of both software engineering and systems administration, with a f...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebianew delhi, delhi, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 5 days ago
    • Promoted
    Xebia - Senior / Lead / Principal Site Reliability Engineer

    Xebia - Senior / Lead / Principal Site Reliability Engineer

    Xebia IT Architects India Pvt LtdGurugram
    Role : Site Reliability Engineer Experience Range : 7 - 12 Years Location : Pune & Chennai, Bangalore , Gurgaon Mode of Work : Hyb...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Incident Management

    Site Reliability Engineer - Incident Management

    FxConsultingGurgaon
    Job Title : Site Reliability Engineer Location : Gurgaon, India Experience : 6 to 9 years Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer I

    Senior Site Reliability Engineer I

    ConfidentialMumbai, Gurgaon / Gurugram, Chennai
    This Senior Site Reliability Engineer (SRE) position offers the opportunity to work on impactful projects that enhance reliability and reduce manual work through automation.You ll leverage your exp...Show moreLast updated: 7 days ago
    • Promoted
    RELX - Site Reliability Engineer - IAC Terraform

    RELX - Site Reliability Engineer - IAC Terraform

    REED ELSEVIER INDIA (a part of RELX India Pvt Ltd)Gurugram
    Job Description : - Lead initiatives to identify and eliminate manual, repetitive tasks through automation and tooling.Develop s...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer - AWS / Azure Cloud Services

    Site Reliability Engineer - AWS / Azure Cloud Services

    SkyFlowDelhi, IN
    Skyflow is a data privacy vault company built to radically simplify how companies isolate, protect, and govern their customers most sensitive data. With its global network of data privacy vaults, Sk...Show moreLast updated: 4 days ago
    • Promoted
    Site Reliability Engineer - CI / CD

    Site Reliability Engineer - CI / CD

    hirezy.aiDelhi, IN
    Remote
    Technical Skills : - Programming : Proficiency in languages like Python, Bash, or Java is essential.Operating Systems : Deep understanding of Linux / Windows operating ...Show moreLast updated: 30+ days ago
    • Promoted
    Project Manager - Site Reliability

    Project Manager - Site Reliability

    Hudson RPODelhi, IN
    Role : SRE Project Manager Location : Gurugram The SRE Project Manager will be responsible for the planning, implementation, and tracking of SRE projects f...Show moreLast updated: 12 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ConfidentialGurgaon / Gurugram
    As a Site Reliability Engineer, you'll use your advanced development and operations knowledge to identify and prioritize issues. Find universal solutions to common problems and mentor and support ju...Show moreLast updated: 30+ days ago
    3331-Site Reliability Engineer I

    3331-Site Reliability Engineer I

    Innovaccer AnalyticsNoida, UP, IN
    Quick Apply
    With every line of code, we accelerate our customers' success, turning complex challenges into innovative solutions.Collaboratively, we transform each data point we gather into valuable insights fo...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordDelhi, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 16 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2gurgaon, haryana, in
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 5 days ago
    • Promoted
    Staff Engineer - Site Reliability

    Staff Engineer - Site Reliability

    DashhireDelhi, IN
    Remote
    Responsibilities : - The Site Reliability Engineering (SRE) team is responsible for the reliability, scalability, stability and performance of systems and services.Th...Show moreLast updated: 28 days ago
    • Promoted
    Gemini Solutions - Site Reliability Engineer - Cloud Solutions

    Gemini Solutions - Site Reliability Engineer - Cloud Solutions

    Gemini Solutions Private LimitedGurgaon
    Position Summary : In this role, you will play a crucial part in shaping the firm's infrastructure reliability and efficiency by implementing robust Site Reliab...Show moreLast updated: 19 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersDelhi, IN
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 22 days ago