Talent.com
This job offer is not available in your country.
Site Reliability Engineer - Cloud Infrastructure

Site Reliability Engineer - Cloud Infrastructure

PaddleliftGurugram
30+ days ago
Job description

About the Role :

We are looking for an experienced Site Reliability Engineer (SRE) to join our team and help us enhance the reliability, scalability, and performance of our cloud-based infrastructure.

As an SRE, you will work collaboratively with development and operations teams to ensure high availability, operational efficiency, and continuous improvement in our production environment.

This role is ideal for someone who has a deep understanding of DevOps principles, strong automation skills, and experience with cloud platforms and container Responsibilities :

  • Design, implement, and maintain scalable, highly available, and resilient infrastructure on AWS, GCP, or Azure cloud platforms.
  • Manage and automate infrastructure provisioning, scaling, and management using Terraform, Ansible, or similar tools.
  • Implement, monitor, and optimize CI / CD pipelines to ensure seamless and reliable release automation.
  • Write scripts and automation workflows for environment setup, deployment, and configuration using tools like Jenkins, Terraform, and Ansible.
  • Deploy, manage, and optimize containerized applications using Docker and Kubernetes for container orchestration.
  • Set up and manage monitoring, alerting, and logging systems to proactively detect and mitigate issues.

Tools include Prometheus, Grafana, ELK stack, etc.

  • Troubleshoot and resolve issues that arise in production environments, ensuring minimal downtime and optimal performance.
  • Take part in on-call rotations and respond to incidents, collaborating with cross-functional teams to ensure root causes are identified and mitigated.
  • Continuously monitor system performance and optimize resources to improve uptime, latency, and cost efficiency.
  • Review and improve system reliability through performance tuning and proactive capacity planning.
  • Work with software engineers to improve application stability, performance, and scalability in a fast-paced development environment.
  • Create and maintain detailed documentation for systems, processes, and runbooks to ensure knowledge sharing and best practices.
  • Contribute to a culture of continuous improvement by identifying operational inefficiencies and recommending Skills & Experience :
  • 3 to 6 years of experience in Site Reliability Engineering (SRE), DevOps, or related roles.
  • Proficient in cloud platforms : AWS, GCP, or Azure.
  • Strong expertise with automation tools such as Terraform, Ansible, Jenkins, or equivalent.
  • Solid experience with containerization and orchestration tools like Docker and Kubernetes.
  • Proficient in setting up and managing monitoring and alerting systems (e., Prometheus, Grafana, ELK stack).
  • Hands-on experience with CI / CD pipelines and release automation.
  • Strong problem-solving skills, particularly in incident management and troubleshooting under pressure.
  • Familiarity with scripting languages such as Python, Bash, or Go is a plus.
  • Experience with infrastructure as code (IaC) practices and tools
  • ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Gurugram

    Related jobs
    • Promoted
    Site Reliability Engineer - Cloud Infrastructure

    Site Reliability Engineer - Cloud Infrastructure

    StashFinGurugram
    Position Overview : As a Site Reliability Engineer (SRE) within our organization, you'll play a pivotal role in enh...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SentiLinkGurugram, Haryana, India
    By building the future of identity verification in the United States and reinventing the currently clunky, ineffective, and expensive process, we believe strongly that the future will be 10x better...Show moreLast updated: 22 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    trellixINDIA
    Trellix, the trusted CISO ally, is redefining the future of cybersecurity and soulful work.Our comprehensive, GenAI-powered platform helps organizations confronted by todays most advanced threats g...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - IAC Terraform

    Site Reliability Engineer - IAC Terraform

    BuzzhireGurugram
    Job Description : Responsibilities : - Define and enforce SLOs, SLIs, and error budgets across micros...Show moreLast updated: 30+ days ago
    Site Reliability Engineer, Google Cloud Storage

    Site Reliability Engineer, Google Cloud Storage

    Google India Pvt LtdINDIA
    Site Reliability Engineer, Google Cloud Storage.At Google, we have a vision of empowerment and equitable opportunity for all Aboriginal and Torres Strait Islander peoples and commit to building rec...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    NatWest GroupINDIA
    Join us as a Site Reliability Engineer.In this key role, youll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change manage...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    PhonepeINDIA
    PhonePe is Indias leading digital payments company with 50 crore (500 Million) registered users and 3.Million) merchants covering over 99 PERCENT of the postal codes across India.On the back of it...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - AWS Infrastructure

    Site Reliability Engineer - AWS Infrastructure

    CareerNet TechnologiesGurgaon
    Requirements : - Minimum 5 years of experience in an SRE / DevOps position for SaaS based products.Experience in managing mission mission-cr...Show moreLast updated: 30+ days ago
    Principal Site Reliability Engineer, Database Infrastructure

    Principal Site Reliability Engineer, Database Infrastructure

    autodesk india pvt ltdINDIA
    Principal Site Reliability Engineer, Database Infrastructure.We are looking for a Principal Site Reliability Engineer (SRE) who is passionate about cloud infrastructure and proficient in MySQL data...Show moreLast updated: 30+ days ago
    • Promoted
    Nomiso - Site Reliability Engineer

    Nomiso - Site Reliability Engineer

    NOMISO INDIA PRIVATE LIMITEDGurgaon
    Position Overview : We are seeking an SRE to join our high-impact platform engineering team.You will maintain SLAs for r...Show moreLast updated: 2 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Citadel SecuritiesGurugram
    Candidates who have less than 3 years of experience should possess : .Good knowledge of UNIX / Linux command line.Good understanding of the usage of TCP / IP and UDP networking in applications.Basic unde...Show moreLast updated: 30+ days ago
    • Promoted
    Zinnia - Site Reliability Engineer III

    Zinnia - Site Reliability Engineer III

    ZinniaGurgaon
    Who We Are : Zinnia is the leading technology platform for accelerating life and annuities growth.With innovative enter...Show moreLast updated: 16 days ago
    Site Reliability Engineer - Infrastructure Automation

    Site Reliability Engineer - Infrastructure Automation

    SYNECHRONINDIA
    We are seeking a skilled and experienced SRE Engineer to join our team.The ideal candidate will have a strong background in Site Reliability Engineering (SRE), with expertise in automating and mana...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    tcg digital solutions pvt ltdINDIA
    Bachelors or masters degree in Computer Science, Engineering, or related field.Essential Skills (Two top skills).AWS Ecosystem EKS, EC2, DynamoDB, Lambda, etc. The SRE team should include some memb...Show moreLast updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Newnovation SolutionsGurugram, Haryana, India
    We are seeking a proactive and technically strong Site Reliability Engineer (SRE) to ensure the stability performance and scalability of our Data Engineering Platform. You will work on cutting-edge ...Show moreLast updated: 30+ days ago
    • Promoted
    DevOps Engineer - Cloud Infrastructure

    DevOps Engineer - Cloud Infrastructure

    ConsultBae India Private LimitedGurgaon
    Job Overview : We are seeking a skilled DevOps Engineer to join our team and support the development, deployment, and op...Show moreLast updated: 12 days ago
    • Promoted
    Site Reliability Engineer - Docker

    Site Reliability Engineer - Docker

    questhiringGurugram
    Responsibilities : - You will architect and build for high performance and scale.You will work on continuously improving...Show moreLast updated: 3 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    CventGurugram, Haryana, India
    Site Reliability is about combining development and operations knowledge and skills to help make the organization better. Whether you have a development background and are interested in learning mor...Show moreLast updated: 22 days ago