Talent.com
Site Reliability Engineer - 2
Site Reliability Engineer - 2Confidential • Bengaluru / Bangalore
Site Reliability Engineer - 2

Site Reliability Engineer - 2

Confidential • Bengaluru / Bangalore
30+ days ago
Job description

As an SRE-2 at MoEngage, you'll be a critical member of our SRE team, responsible for the health and performance of key services and contributing directly to the evolution of our infrastructure at a scale that few engineers get to experience. This is your chance to deepen your technical expertise, take on more ownership, and mentor emerging talent while working on a platform that operates at the cutting edge.

What You'll Do to Keep Our Engines Roaring

  • Be a Reliability Champion : Take ownership of the reliability, performance, and efficiency of critical services.
  • Automate, Automate, Automate : Design, develop, and implement robust automation solutions to eliminate toil, streamline operations, and improve system resilience.
  • Battle Incidents (and Win) : Lead troubleshooting efforts for complex production incidents, perform in-depth root cause analysis, and implement sustainable preventative measures.
  • Sculpt Our Infrastructure : Actively contribute to the design, implementation, and optimization of our cloud infrastructure on AWS and GCP , leveraging your expertise in technologies like Kubernetes.
  • Enhance Observability : Implement and refine advanced monitoring, alerting, and logging solutions to gain deep insights into system behavior and predict potential issues.
  • Collaborate for Success : Partner closely with development teams to influence architectural decisions, ensuring reliability, scalability, and security are built in from the start.
  • Strengthen Our Security Posture : Implement and advocate for advanced security practices within our infrastructure and operational workflows.
  • Drive Efficiency : Analyze and optimize cloud infrastructure spend, identifying and implementing cost-saving opportunities.
  • Guide the Next Wave : Mentor and guide SRE-1 engineers, contributing to the growth and knowledge sharing within the team.
  • Be Ready for Action : Participate in our on-call rotation, acting as a key point of escalation and resolution for critical issues.

What Makes You the Ideal Candidate

  • 3-5 years of hands-on experience in Site Reliability Engineering, DevOps, or a similar role with a strong focus on production systems.
  • Demonstrated expertise in Python or Go —you have a proven track record of automating complex tasks.
  • Strong command of AWS and / or GCP cloud platforms .
  • In-depth experience with containerization and orchestration using Kubernetes (K8s, ArgoCD, Helm / Kustomize) .
  • Experience with infrastructure as code tools like Terraform or Ansible is highly valued.
  • Solid understanding and experience with monitoring and observability stacks (VictoriaMetrics, Prometheus, Grafana, ELK stack, etc.).
  • Deep knowledge of Linux / Unix systems internals and advanced networking concepts .
  • Proven ability to diagnose and resolve complex issues in large-scale distributed systems.
  • A strong understanding of Cloud Security and Information Security principles and best practices .
  • Experience with cloud cost analysis and optimization techniques.
  • Familiarity with CI / CD pipelines and GitOps methodologies.
  • Experience with messaging queues and distributed systems (Celery, Kafka) is a plus.
  • Excellent communication, collaboration, and problem-solving skills.
  • A desire to mentor and lead by example.
  • Skills Required

    Reliability Engineering, Devops, Python, Aws, Kubernetes

    Create a job alert for this search

    Site Reliability Engineer • Bengaluru / Bangalore

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    Synamedia • Bengaluru, Karnataka, India
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show more
    Last updated: 15 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Reyika • Bengaluru, Karnataka, India
    Senior Site Reliability Engineer / Reliability Architect.Pune,Bengalore,Chennai,Pune,Noida.Reliability Architect with over 9 years of experience in proactive monitoring, automation, and observabili...Show more
    Last updated: 6 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HRhelpdesk • bangalore district, karnataka, in
    Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.As a Site Reliability Engineer (SRE), you will be responsible for building and maintainin...Show more
    Last updated: 11 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent Partners • Bengaluru, India
    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure power...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Delta Electronics India • Bengaluru, Karnataka, India
    Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to balance reliability with feature velocity and ensure optimal system availability.Respond to...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Yum! India Global Services Private Limited • Bengaluru, Karnataka, India
    Roles & Responsibilities Design, test, implement, deploy, and support continuous integration pipelines that build and deploy to cloud-based environments (development, stage / testing, production).In...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GREYTIP SOFTWARE PRIVATE LIMITED • Bengaluru, India
    We are looking for a skilled Site Reliability Engineer II to join our SRE team.The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 production support....Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synechron • Bengaluru, Karnataka, India
    We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer IC3

    Site Reliability Engineer IC3

    Oracle • Bengaluru, Republic Of India, IN
    Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and e...Show more
    Last updated: 14 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Aqilea (formerly Soltia) • Bangalore, Karnataka, India
    Quick Apply
    We are a consulting company with a bunch of technology-interested and happy people!.We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and...Show more
    Last updated: 30+ days ago
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc. • Bengaluru, Karnataka, India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 12 days ago • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    RecRoots • Bengaluru, India
    Key Job Responsibilities and Duties : .The core premise for the SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing...Show more
    Last updated: 30+ days ago • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    o9 Solutions, Inc. • Bengaluru, Republic Of India, IN
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 12 days ago • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    Backblaze External Website • Bengaluru, Karnataka, India
    Backblaze is the object storage leader in the open cloud movement fueling customer success with cloud storage built purposefully to unlock budgets unburden administrators and unleash innovators.Tog...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.money • Bengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show more
    Last updated: 22 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    JRD Systems • Bengaluru, India
    Site Reliability Engineer (Windows / Cloud / Automation).We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments.T...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    Backblaze External Website • Bengaluru, Karnataka, India
    Backblaze is the object storage leader in the open cloud movement fueling customer success with cloud storage built purposefully to unlock budgets unburden administrators and unleash innovators.Tog...Show more
    Last updated: 11 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Landmark Group • Bengaluru, India
    Ensure reliability and high availability of Java and microservices-based applications through proactive monitoring and automation. Define and track SLIs / SLOs to maintain service performance and ...Show more
    Last updated: 12 days ago • Promoted