Talent.com
Site Reliability Engineer

Site Reliability Engineer

CodeKarmasecunderabad, telangana, in
17 days ago
Job description

Site Reliability Engineer (Multi-Cloud Deployments)

Location : Bangalore / Remote

Experience : 4–10 years

Type : Full-time (6-month probation)

About CodeKarma

CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s workflow.

Our platform runs both as SaaS and as sub-account / on-prem deployments within our customers’ cloud environments.

We’re looking for engineers who can take ownership of these deployments end-to-end — from setup to monitoring, upgrades, and ongoing reliability.

What You’ll Do

You’ll be responsible for managing CodeKarma’s distributed deployments across client environments — ensuring reliability, security, and performance at scale.

  • Deploy and manage CodeKarma clusters across AWS, GCP, and Azure customer sub-accounts.
  • Monitor, upgrade, and maintain Kubernetes clusters and related infrastructure.
  • Implement observability, alerting, and disaster recovery for each deployment.
  • Handle CI / CD automation for platform releases, patches, and version upgrades.
  • Work closely with client engineering teams to adapt deployments to their environments, policies, and security constraints.
  • Diagnose and resolve environment-specific issues across networking, storage, and configuration layers.
  • Build and maintain infrastructure playbooks, Helm charts, and Terraform modules for standardized deployment.

What We’re Looking For

  • Strong experience managing Kubernetes clusters (EKS, GKE, AKS, or on-prem equivalents).
  • Deep understanding of Kubernetes internals, Helm, ingress controllers, networking, and storage classes .
  • Hands-on experience with CI / CD tools (GitHub Actions, ArgoCD, or similar).
  • Familiarity with monitoring and alerting stacks (Prometheus, Grafana, Loki, ELK, etc.).
  • Working knowledge of cloud infrastructure across AWS / GCP / Azure.
  • Ability to work directly with client engineering and DevOps teams , understanding their constraints and helping them integrate CodeKarma.
  • Strong debugging and communication skills — you’ll often be the bridge between CodeKarma and client infrastructure.
  • Why Join Us

  • Manage real, large-scale production environments across multiple enterprises.
  • Work directly with founders and senior engineers to shape how CodeKarma scales across clients.
  • High ownership, fast-moving environment, and exposure to deep-tech systems.
  • How to Apply

    Please share :

  • A short summary of your Kubernetes experience (cluster management, scaling, debugging, etc.).
  • Any automation or deployment tooling you’ve built or maintained.
  • Links to your GitHub / GitLab / blog posts (if available).
  • Create a job alert for this search

    Site Reliability Engineer • secunderabad, telangana, in

    Related jobs
    • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 21 days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 21 days ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy ServicesHyderabad, Telangana, India
    We are currently seeking a for a position SRE Engineer in Hyderabad.Job ID : 375656 • • • •Apply Here : • • ( TCS iBegin ) • •Job Description : • • Proven experience as a DevOps / SRE Engineer Expertise in...Show moreLast updated: 18 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeHyderabad, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 9 days ago
    • Promoted
    Engineer, Site Reliability [T500-20502]

    Engineer, Site Reliability [T500-20502]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 21 days ago
    • Promoted
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 21 days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 21 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NationsBenefits IndiaHyderabad, Telangana, India
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show moreLast updated: 17 days ago
    • Promoted
    Engineer, Site Reliability T500-20519

    Engineer, Site Reliability T500-20519

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 22 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiHyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 6 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    Atyeti IncHyderabad, Telangana, India
    We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our growing team.Bachelor’s degree in computer science, Engineering, or equivalent practical experience.Site Re...Show moreLast updated: 5 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20279]

    Sr Engineer, Site Reliability [T500-20279]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 22 days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 22 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SID Global SolutionsHyderabad, Telangana, India
    Job Role : Site Reliability Engineer (SRE) – GCP.SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortu...Show moreLast updated: 28 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20437]

    Sr Engineer, Site Reliability [T500-20437]

    TMUS Global SolutionsHyderabad, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 22 days ago
    • Promoted
    Engineer, Site Reliability [T500-20519]

    Engineer, Site Reliability [T500-20519]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 21 days ago
    • Promoted
    Engineer, Site Reliability [T500-20518]

    Engineer, Site Reliability [T500-20518]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 21 days ago