Talent.com
This job offer is not available in your country.
Site Reliability Engineer - DevOps

Site Reliability Engineer - DevOps

hirist.com - IT JobsHyderabad
16 days ago
Job description

Responsibilities :

  • Design and maintain Ansible playbooks and Ansible Tower workflows for disaster recovery and failover automation.
  • Automate failover processes across relational databases (Oracle, MySQL, PostgreSQL, SQL Server).
  • Integrate with tools like Pronghorn for DNS failover and routing logic.
  • Build self-healing scripts and reusable automation patterns for large-scale, asynchronous

systems.

  • Develop a centralized failover dashboard with visual indicators and dependency mapping.
  • Collaborate with DBAs, application owners, and network engineers to ensure seamless
  • failover orchestration.

  • Support Kubernetes-based scaling strategies and CI / CD integration using GitLab.
  • Contribute to operational readiness frameworks including blue-green deployments and observability.
  • Required Skills & Experience :

  • 5+ years in DevOps / SRE roles within enterprise environments.
  • Strong scripting skills in Bash and Python.
  • Expertise in Ansible and Ansible Tower for infrastructure automation.
  • Experience with CI / CD tools like Jenkins and GitLab.
  • Proficiency in Git, version control, and release strategies.
  • Familiarity with Kubernetes, and AWS cloud services.
  • Deep understanding of relational databases and failover strategies.
  • Knowledge of networking, load balancing, and asynchronous messaging systems.
  • Experience with observability tools and monitoring systems.
  • Excellent problem-solving and cross-functional collaboration skills.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Hyderabad