Talent.com
Service Reliability Engineers
Service Reliability EngineersConfidential • Hyderabad / Secunderabad, Telangana, India
Service Reliability Engineers

Service Reliability Engineers

Confidential • Hyderabad / Secunderabad, Telangana, India
28 days ago
Job description

Our Site Reliability Engineers (SREs) play a crucial role in ensuring our systems are reliable, scalable, and efficient. We are looking for an experienced SRE to join our team and help us maintain and improve our infrastructure.

Responsibilities

  • Monitor and Maintain Systems : Ensure the availability, performance, and reliability of our production environment by monitoring system health and responding to incidents.
  • Automation : Develop and implement automation tools to reduce manual intervention and improve system efficiency.
  • Collaboration : Work closely with development teams to design and implement scalable and reliable systems.
  • Performance Tuning : Analyze system metrics to identify performance bottlenecks and optimize system performance.
  • Incident Management : Lead incident response efforts, conduct root cause analysis, and implement preventive measures.
  • Documentation : Create and maintain comprehensive documentation for system architecture, processes, and procedures.
  • Capacity Planning : Conduct capacity planning and ensure systems can handle future growth.

Qualifications

  • Experience : 6+ years of experience in site reliability engineering, operations, or software engineering.
  • Education : Bachelor's degree in Computer Science, Engineering, or a related field.
  • Technical Skills : Proficiency in scripting languages (e.g., Python, Ruby), experience with containerization (Docker, Kubernetes), and familiarity with cloud platforms (AWS, GCP, Azure).
  • System Knowledge : Strong understanding of Linux / Unix systems, networking, and infrastructure components.
  • Problem-Solving : Excellent troubleshooting and problem-solving skills.
  • Communication : Strong communication and collaboration skills to work effectively with cross-functional teams.
  • Certifications : Relevant certifications (e.g., AWS Certified Solutions Architect, Certified Kubernetes Administrator) are a plus.
  • Preferred Skills

  • Experience with configuration management tools (e.g., Ansible, Chef, Puppet).
  • Knowledge of CI / CD pipelines and tools (e.g., Jenkins, GitLab CI).
  • Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Why Join Us

  • Innovative Environment : Work on cutting-edge technologies and projects.
  • Growth Opportunities : Opportunities for professional development and career advancement.
  • Collaborative Culture : Join a team that values collaboration, diversity, and inclusion.
  • Competitive Benefits : Comprehensive benefits package including health insurance, retirement plans, and more.
  • Skills Required

    Unix, Chef, Prometheus, Elk Stack, Grafana, Jenkins, Gcp, Linux, Docker, Ansible, Ruby, Puppet, Azure, Kubernetes, Python, Aws

    Create a job alert for this search

    Reliability Engineer • Hyderabad / Secunderabad, Telangana, India

    Related jobs
    Service Reliability Expert

    Service Reliability Expert

    NationsBenefits India • Hyderabad, Republic Of India, IN
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show more
    Last updated: 30+ days ago • Promoted
    Senior Service Delivery Engineer

    Senior Service Delivery Engineer

    Elios Talent • Hyderabad, India
    Senior Service Delivery Engineer.Own service delivery operations across incident response, change control, and reliability for high-availability platforms. Lead major incident management, driving ra...Show more
    Last updated: 9 days ago • Promoted
    Infrastructure Reliability Engineer

    Infrastructure Reliability Engineer

    VXI Global Solutions • Hyderabad, Republic Of India, IN
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show more
    Last updated: 10 days ago • Promoted
    Reliability Engineer

    Reliability Engineer

    Elios Talent • Hyderabad, Republic Of India, IN
    Build, automate, and support cloud-native infrastructure powering high-availability platforms.Contribute to automation-first engineering across AWS, Terraform, CI / CD, and observability tooling.Impr...Show more
    Last updated: 9 days ago • Promoted
    Reliability Operations Engineer

    Reliability Operations Engineer

    Elios Talent • Hyderabad, Republic Of India, IN
    Intermediate Service Delivery Engineer.Support incident response, change control, and day-to-day service delivery operations. Assist triage and coordination during high-severity incidents.Contribute...Show more
    Last updated: 9 days ago • Promoted
    SRE (Site Reliability Engineer)

    SRE (Site Reliability Engineer)

    Tata Consultancy Services • Hyderabad, Republic Of India, IN
    Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional), Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment experience,.Show more
    Last updated: 14 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Hyderabad, Telangana, India
    GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more
    Last updated: 30+ days ago • Promoted
    Gcp Site Reliability Engineer

    Gcp Site Reliability Engineer

    inTune Systems Inc • Hyderabad, Republic Of India, IN
    We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team.As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our...Show more
    Last updated: 4 days ago • Promoted
    Senior Service Reliability Engineer

    Senior Service Reliability Engineer

    Elios Talent • Hyderabad, Republic Of India, IN
    Senior Service Delivery Engineer.Own service delivery operations.Lead major incident management.KPIs, reporting, RCA processes, playbooks, and structured workflows. ITIL, Agile, and SRE practices to...Show more
    Last updated: 9 days ago • Promoted
    Senior Microservices Engineer II [T500-21446]

    Senior Microservices Engineer II [T500-21446]

    Marriott Tech Accelerator • Hyderabad, Telangana, India
    Marriott Tech Accelerator is part of Marriott International, a global leader in hospitality.American multinational company that operates a vast array of lodging brands, including hotels and residen...Show more
    Last updated: 15 days ago • Promoted
    Platform Reliability Engineer

    Platform Reliability Engineer

    Elios Talent • Hyderabad, Republic Of India, IN
    Build, automate, and support cloud-native infrastructure powering high-availability platforms.Contribute to automation-first engineering across AWS, Terraform, CI / CD, and observability tooling.Impr...Show more
    Last updated: 9 days ago • Promoted
    Infrastructure Reliability Engineer

    Infrastructure Reliability Engineer

    TMUS Global Solutions • Hyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Hyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Lead Systems Reliability Engineer

    Lead Systems Reliability Engineer

    TMUS Global Solutions • Hyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
    Last updated: 30+ days ago • Promoted
    Hosting Reliability Engineer

    Hosting Reliability Engineer

    MathWorks • Hyderabad, Telangana, India
    Would you like to join a team making a positive impact at MathWorks? IT Hosting is modernizing our infrastructure and the way we operate it. You will be responsible for designing, deploying, maintai...Show more
    Last updated: 15 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NationsBenefits India • Hyderabad, Telangana, India
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    VXI Global Solutions • Hyderabad, Telangana, India
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show more
    Last updated: 30+ days ago • Promoted
    Platform Reliability Engineer

    Platform Reliability Engineer

    NationsBenefits India • Hyderabad, Republic Of India, IN
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show more
    Last updated: 30+ days ago • Promoted