Talent.com
This job offer is not available in your country.
Cubic Corporation - Principal Site Reliability Engineer

Cubic Corporation - Principal Site Reliability Engineer

Cubic Transportation Systems India Pvt. Ltd.Hyderabad
1 day ago
Job description

Job Details :

The Senior Site Reliability Engineer is a leader within the team, responsible for designing, building, and owning the complex infrastructure and deployment systems that underpin our live environments. This role is both hands-on and strategic, requiring deep technical expertise and strong collaboration skills. You will mentor junior engineers and work closely with development teams to architect and implement systems that are reliable, scalable, and highly automated. Senior SREs are expected to drive the adoption of robust, automated solutions and ensure those solutions are well-documented and understood across engineering.

Core Responsibilities :

Infrastructure Design & Maintenance :

  • Lead the design, build, and maintenance of our core infrastructure using infrastructure-as-code (IaC) tools (e.g., Terraform, CloudFormation).
  • Own the provisioning and lifecycle management of production, staging, and other critical environments.
  • Architect and implement shared infrastructure components (e.g., logging, metrics, service mesh, load balancing).
  • Drive continuous improvements to infrastructure scalability, availability, and performance.
  • Act as a key partner to development teams, providing infrastructure primitives and strategic guidance on deployment needs.

Deployment Systems & CI / CD :

  • Design, own, and enhance our CI / CD pipelines (GitHub Actions, Argo CD) to maximize reliability, velocity, and automation.
  • Establish and enforce best practices across all environments for deployment, rollback, and observability.
  • Partner with developers to architect and streamline the testing and delivery of code to production.
  • Champion the elimination of manual steps in deployment and operations workflows.
  • Reliability, Observability & Tooling :

  • Architect and manage our monitoring, alerting, and logging infrastructure (Kube-Prometheus-Grafana stack).
  • Define, implement, and track SLOs / SLIs for core services, holding service owners accountable.
  • Proactively identify and eliminate single points of failure, performance bottlenecks, and sources of instability.
  • Lead reliability reviews, blameless post-incident analyses, and capacity planning initiatives.
  • Perform basic debugging of Java applications to assist development teams in & Knowledge Sharing :
  • Ensure all systems and processes built or maintained by the SRE team are accompanied by thorough, up-to-date documentation.
  • Mentor other engineers and contribute to shared knowledge bases, runbooks, and developer-facing materials.
  • Lead internal training sessions, walkthroughs, and pairings to cross-train teammates and reduce knowledge silos.
  • Collaboration & Culture :

  • Work closely with the SRE Lead to define team strategy, prioritize work, and execute on team goals.
  • Mentor junior team members and act as a technical leader across engineering.
  • Participate in on-call rotations, acting as an escalation point for complex issues.
  • Champion a culture of blameless learning, transparency, and continuous improvement.
  • Qualifications & Skills :

  • Experience : 7+ years in a senior SRE, DevOps, or related infrastructure role.
  • Cloud : Deep, hands-on expertise with AWS, including services like ECS, EKS, Aurora (Postgres), EC2, S3, and VPC.
  • Containers & Orchestration : Strong, production-level proficiency with Kubernetes and Helm. Deep understanding of container runtimes and networking.
  • CI / CD : Extensive experience designing, building, and managing complex CI / CD pipelines using tools like GitHub Actions and Argo CD. Experience with container registries like GHCR.
  • IaC : Expertise in Infrastructure as Code, with strong proficiency in Terraform or CloudFormation.
  • Observability : Proven experience with observability stacks, particularly the Kube-Prometheus-Grafana stack, including custom metric instrumentation and advanced dashboarding.
  • Debugging : Ability to perform basic performance analysis and debugging of applications (Java experience is a strong plus).
  • Leadership : Demonstrated ability to mentor junior engineers, lead technical projects, and drive architectural decisions.
  • Incident Management : Experience leading incident response, conducting blameless post-mortems, and driving resulting action items to completion.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad

    Related jobs
    • Promoted
    Engineer, Site Reliability [T500-20520]

    Engineer, Site Reliability [T500-20520]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    HuntingCube Recruitment SolutionsHyderabad, Telangana, India
    Job opening for Lead, Tech (Site Reliability Engineering) – Systems Strict Eligibility Criteria – Please Read Before Applying This role is with a leading global High-Frequency Trading (HFT) firm ...Show moreLast updated: 8 days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Insight GlobalHyderabad, Telangana, India
    Must be able to join within 30 days or less!.An employer is looking for an SRE to join their enterprise level SRE team.They are building a specialized team of Senior Site Reliability Engineers to a...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Site Reliability [T500-20519]

    Engineer, Site Reliability [T500-20519]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Engineer, Site Reliability [T500-20503]

    Engineer, Site Reliability [T500-20503]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Engineer, Site Reliability [T500-20504]

    Engineer, Site Reliability [T500-20504]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Engineer, Site Reliability [T500-20502]

    Engineer, Site Reliability [T500-20502]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordHyderabad, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 20 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20463]

    Sr Engineer, Site Reliability [T500-20463]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftHyderabad, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 2 days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Cubic Transportation SystemsHyderabad, Telangana, India
    Hiring Principal Site Reliability Engineer.Site Reliability Engineer (SRE).You will blend software engineering and systems operations to automate processes, monitor performance, lead incident respo...Show moreLast updated: 28 days ago
    • Promoted
    Principal Engineer, Site Reliability [T500-20295]

    Principal Engineer, Site Reliability [T500-20295]

    ANSRHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 9 days ago
    • Promoted
    Engineer, Site Reliability [T500-20518]

    Engineer, Site Reliability [T500-20518]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GSPANN Technologies, IncHyderabad, Telangana, India
    GSPANN is a global IT services and consultancy provider headquartered in Milpitas, California (U.With five global delivery centers across the globe, GSPANN provides digital solutions that support t...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
    • Promoted
    Principal Engineer, Site Reliability - Accounting Technology [T500-20232]

    Principal Engineer, Site Reliability - Accounting Technology [T500-20232]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 16 days ago