Talent.com
This job offer is not available in your country.
SRE

SRE

ConfidentialHyderabad / Secunderabad, Telangana
26 days ago
Job description

Roles & Responsibilities :

  • Design and implement systems and processes to improve the reliability, scalability, and performance of applications
  • Automate routine operational tasks, such as deployments, monitoring, and incident response, to improve efficiency and reduce human error
  • Develop and maintain monitoring tools and dashboards to track system health, performance, and availability
  • Respond to and resolve incidents promptly, conducting root cause analysis and implementing preventive measures
  • Provide ongoing maintenance and support for existing systems, ensuring that they are secure, efficient, and reliable
  • Work on integrating various software applications and platforms to ensure seamless operation across the organization
  • Implement and maintain security measures to protect systems from unauthorized access and other threats

What we expect of you

We are all different, yet we all use our unique contributions to serve patients.

Basic Qualifications :

  • Doctorate degree OR
  • Masters degree and 4 to 6 years of Computer Science, IT or related field experience OR
  • Bachelors degree and 6 to 8 years of Computer Science, IT or related field experience OR
  • Diploma and 10 to 12 years of Computer Science, IT or related field experience
  • Preferred Qualifications :

    Functional

    Skills :

    Must-Have Skills (Not more than 3 to 4) :

  • Working experience with various cloud services on AWS (Azure, GCP) and containerization technologies (Docker, Kubernetes).
  • Strong programing skills in languages such as Python. Working experience of infrastructure as code (IaC) tools (Terraform, CloudFormation).
  • Working experience with monitoring and alerting tools (Prometheus, Grafana, etc.).
  • Working experience with DevOps / MLOps practice and CI / CD pipelines
  • Proficiency in automated testing tools and frameworks (e.g., Selenium, JUnit, pytest), Incident Management, Production Issue Root Cause Analysis and Improve System Quality.
  • Good-to-Have

    Skills : Strong understanding of cloud platforms (e.g., AWS, GCP, Azure) and containerization technologies (e.g., Docker, Kubernetes)

  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, Splunk)
  • Experience with data processing tools like Hadoop, Spark, or similar
  • Experience with SAP integration technologies
  • Professional Certifications :

    AWS Developer certification (preferred)

    Soft

    Skills : Excellent analytical and troubleshooting skills

  • Strong verbal and written communication skills
  • Ability to work effectively with global, virtual teams
  • High degree of initiative and self-motivation
  • Ability to manage multiple priorities successfully
  • Team-oriented, with a focus on achieving team goals
  • Strong presentation and public speaking skills
  • Skills Required

    Python, Testing Tools, Root Cause Analysis, Devops, MLops, Testing Frameworks

    Create a job alert for this search

    Sre • Hyderabad / Secunderabad, Telangana