Experience Required : 3-6 years
Location : Gurgaon
Department : Product and Engineering
🔧 Key Responsibilities
- Design, implement, and maintain highly available and scalable infrastructure using AWS Cloud Services.
- Build and manage Kubernetes clusters (EKS, self-managed) to ensure reliable deployment and scaling of microservices.
- Develop Infrastructure-as-Code using Terraform, ensuring modular, reusable, and secure provisioning.
- Containerize applications and optimize Docker images for performance and security.
- Ensure CI / CD pipelines (Jenkins, GitHub Actions, etc.) are optimized for fast and secure deployments.
- Drive SRE principles including monitoring, alerting, SLIs / SLOs, and incident response.
- Set up and manage observability tools (Prometheus, Grafana, ELK, Datadog, etc.).
- Automate routine tasks with scripting languages (Python, Bash, etc.).
- Lead capacity planning, auto-scaling, and cost optimization efforts across cloud infrastructure.
- Collaborate closely with development teams to enable DevSecOps best practices.
- Participate in on-call rotations, handle outages with calm, and conduct postmortems.
🧰 Must-Have Technical Skills
Kubernetes (EKS, Helm, Operators)Docker & Docker ComposeTerraform (modular, state management, remote backends)AWS (EC2, VPC, S3, RDS, IAM, CloudWatch, ECS / EKS)Linux system administrationDatabase tuning based on hardware config.CI / CD pipelines (Jenkins, GitLab CI, GitHub Actions)Logging & monitoring tools : ELK, Prometheus, Grafana, CloudWatchSite Reliability Engineering practicesLoad balancing, autoscaling, and HA architectures💡 Good-To-Have
GCP or Azure exposureSecurity hardening of containers and infrastructureChaos engineering exposureKnowledge of networking (DNS, firewalls, VPNs)👤 Soft Skills
Strong problem-solving attitude; calm under pressureGood documentation and communication skillsOwnership mindset with a drive to automate everythingCollaborative and proactive with cross-functional teams