Talent.com
This job offer is not available in your country.
Senior DevOps Engineer - Site Reliability

Senior DevOps Engineer - Site Reliability

DashhirePune
17 days ago
Job description

We are looking for a skilled and experienced Senior DevOps Engineer to lead the design, implementation, and management of our CI / CD infrastructure, cloud operations, and automation frameworks.

You will play a critical role in ensuring the scalability, security, and reliability of our cloud-native applications and systems.

This is a great opportunity to influence infrastructure decisions, improve developer productivity, and contribute to the overall software delivery lifecycle.

Key Responsibilities :

  • Design, develop, and maintain highly available and scalable infrastructure on cloud platforms such as AWS, Azure, or GCP.
  • Develop and manage CI / CD pipelines using tools such as Jenkins, GitHub Actions, GitLab CI, or CircleCI.
  • Automate infrastructure provisioning using Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Pulumi.
  • Monitor system performance, availability, and security using tools like Prometheus, Grafana, Datadog, ELK Stack, or New Relic.
  • Implement robust logging, alerting, and monitoring strategies to detect and resolve production issues proactively.
  • Collaborate closely with development teams to support DevSecOps practices, CI / CD workflows, and ensure smooth software releases.
  • Manage containerized applications using Docker, Kubernetes, or other orchestration tools.
  • Configure and manage secrets, credentials, and access control across systems securely.
  • Perform incident response, root cause analysis, and implement long-term fixes for infrastructure-related problems.
  • Ensure systems meet compliance, security, and disaster recovery requirements.
  • Mentor junior engineers and advocate best practices in automation, deployment, and system reliability.

Requirements & Qualifications :

  • Bachelors or Masters degree in Computer Science, Engineering, or a related technical field.
  • 4-5+ years of experience in DevOps, Site Reliability Engineering (SRE), or related roles.
  • Proficiency in working with Linux / Unix systems, scripting (e.g., Bash, Python, or Go), and automation.
  • Hands-on experience with one or more cloud platforms (AWS preferred) and services such as EC2, ECS / EKS, Lambda, S3, IAM, etc.
  • Experience with Kubernetes for container orchestration and Helm for managing K8s applications.
  • Deep understanding of networking, firewalls, DNS, load balancers, and cloud security best practices.
  • Expertise with IaC tools like Terraform, Ansible, or Chef / Puppet.
  • Familiarity with version control systems like Git and managing repositories in GitHub, GitLab, or Bitbucket.
  • Knowledge of CI / CD concepts, best practices, and tools for continuous integration and deployment.
  • Strong troubleshooting and performance tuning skills across the stack.
  • Excellent communication and collaboration skills
  • ref : hirist.tech)

    Create a job alert for this search

    Senior Site Reliability Engineer • Pune