This job offer is not available in your country.

Senior DevOps Engineer - Site Reliability

DashhirePune

17 days ago

Job description

We are looking for a skilled and experienced Senior DevOps Engineer to lead the design, implementation, and management of our CI / CD infrastructure, cloud operations, and automation frameworks.

You will play a critical role in ensuring the scalability, security, and reliability of our cloud-native applications and systems.

This is a great opportunity to influence infrastructure decisions, improve developer productivity, and contribute to the overall software delivery lifecycle.

Key Responsibilities :

Design, develop, and maintain highly available and scalable infrastructure on cloud platforms such as AWS, Azure, or GCP.
Develop and manage CI / CD pipelines using tools such as Jenkins, GitHub Actions, GitLab CI, or CircleCI.
Automate infrastructure provisioning using Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Pulumi.
Monitor system performance, availability, and security using tools like Prometheus, Grafana, Datadog, ELK Stack, or New Relic.
Implement robust logging, alerting, and monitoring strategies to detect and resolve production issues proactively.
Collaborate closely with development teams to support DevSecOps practices, CI / CD workflows, and ensure smooth software releases.
Manage containerized applications using Docker, Kubernetes, or other orchestration tools.
Configure and manage secrets, credentials, and access control across systems securely.
Perform incident response, root cause analysis, and implement long-term fixes for infrastructure-related problems.
Ensure systems meet compliance, security, and disaster recovery requirements.
Mentor junior engineers and advocate best practices in automation, deployment, and system reliability.

Requirements & Qualifications :

Bachelors or Masters degree in Computer Science, Engineering, or a related technical field.

4-5+ years of experience in DevOps, Site Reliability Engineering (SRE), or related roles.

Proficiency in working with Linux / Unix systems, scripting (e.g., Bash, Python, or Go), and automation.

Hands-on experience with one or more cloud platforms (AWS preferred) and services such as EC2, ECS / EKS, Lambda, S3, IAM, etc.

Experience with Kubernetes for container orchestration and Helm for managing K8s applications.

Deep understanding of networking, firewalls, DNS, load balancers, and cloud security best practices.

Expertise with IaC tools like Terraform, Ansible, or Chef / Puppet.

Familiarity with version control systems like Git and managing repositories in GitHub, GitLab, or Bitbucket.

Knowledge of CI / CD concepts, best practices, and tools for continuous integration and deployment.

Strong troubleshooting and performance tuning skills across the stack.

Excellent communication and collaboration skills

ref : hirist.tech)

Create a job alert for this search

Senior Site Reliability Engineer • Pune