SRE / DevOps Engineer AWS
Role : Full-Time
Experience : 10 16 Years
Location : Pan India (Remote)
Notice Period : Immediate to 15 Days
Role Overview
We are seeking a highly experienced Site Reliability Engineer (SRE) / DevOps Engineer AWS to drive infrastructure automation, reliability engineering, CI / CD pipeline management, and cloud-native operations across large-scale distributed systems.
The ideal candidate will have deep AWS expertise, strong DevOps mindset, and hands-on experience in scaling mission-critical production systems with high availability, resiliency, and Responsibilities :
- Design, build, and maintain scalable, secure, and resilient cloud infrastructure on AWS.
- Own and optimize CI / CD pipelines for faster, reliable, and secure delivery of applications and services.
- Implement Infrastructure as Code (IaC) using Terraform, CloudFormation, or CDK.
- Drive observability and monitoring strategy using tools like Prometheus, Grafana, ELK / EFK, CloudWatch, Datadog, or New Relic.
- Automate infrastructure provisioning, configuration management, and deployment processes.
- Enhance system reliability, fault tolerance, and disaster recovery strategies.
- Define and monitor SLIs, SLOs, and SLAs, ensuring systems meet business uptime and performance expectations.
- Perform root cause analysis (RCA), capacity planning, and proactive issue resolution.
- Collaborate with development teams to design cloud-native, microservices-based architectures.
- Drive security, compliance, and cost optimization initiatives across AWS workloads.
- Lead incident response, on-call support, and operational readiness for production systems.
- Mentor junior DevOps / SRE engineers and promote DevOps culture across Skills & Experience :
- 10-16 years of overall experience, with strong expertise in DevOps, SRE, and AWS cloud platforms.
- Expert-level proficiency in AWS services : EC2, ECS / EKS, Lambda, S3, VPC, IAM, CloudFront, RDS / Aurora, DynamoDB, CloudWatch, Route 53, API Gateway.
- Strong experience in CI / CD tools : Jenkins, GitLab CI, GitHub Actions, or AWS CodePipeline.
- Proficiency in Infrastructure as Code (IaC) : Terraform, AWS CloudFormation, or AWS CDK.
- Hands-on with containerization and orchestration : Docker, Kubernetes (EKS preferred), Helm.
- Expertise in monitoring, logging, and observability : ELK / EFK, Prometheus, Grafana, CloudWatch, Splunk, Datadog.
- Strong background in Linux / Unix administration and shell scripting.
- Experience with automation / configuration management : Ansible, Chef, or Puppet.
- Strong knowledge of networking, load balancing, security, and firewalls in cloud environments.
- Hands-on experience in incident response, RCA, performance tuning, and high-availability architecture.
- Programming / scripting experience in Python, Go, or Bash.
- Strong problem-solving, analytical, and troubleshooting to Have :
- Experience in multi-cloud (Azure / GCP) environments.
- Knowledge of Service Mesh (Istio / Linkerd) and API gateways.
- Exposure to FinOps and cloud cost optimization strategies.
- Experience with DevSecOps practices, vulnerability scanning, and compliance automation.
- Certifications : AWS Certified Solutions Architect - Professional, AWS Certified DevOps Engineer - Professional, or equivalent.
(ref : hirist.tech)