DevOps Engineer - L3 (Software Engineer II)
Location & Type : Delhi, Full-time
CTC Range (LPA) : 29.00 - 36.25
Role Overview
Experienced DevOps engineer who can own and scale production infrastructure end-to-end - from CI / CD and IaC to observability and incident response. You’ll lead design docs, harden reliability and security, drive cost / perf efficiency.
What You’ll Do
- Architect and maintain CI / CD pipelines (build, test, security scans, deploy, rollback) with quality gates and environment promotions.
- Design and operate container platforms (ECS / EKS or equivalent), service discovery, blue / green & canary strategies, and autoscaling.
- Implement I nfrastructure as Code (Terraform / CDK / CloudFormation), enforce modular, reviewable, and drift-free infra.
- Build observability : metrics / logs / traces, SLOs / SLIs, dashboards, and actionable alerts; reduce MTTR through runbooks and automation.
- Champion platform reliability : capacity planning, HA / DR (multi-AZ), backup / restore testing, change management.
- Own secrets management, IAM least-privilege, network policies, and baseline hardening (CIS where relevant).
- Drive cost optimization (rightsizing, autoscaling policies, savings plans / spot, storage lifecycle) with monthly reporting.
- Establish release / incident processes (postmortems, RCAs) and lead remediation to cut change failure rate.
- Partner with Backend / AI teams to productize models / services (GPU pools, batching, caching layers) and streamline developer workflows.
- lead design reviews, tech spikes, Monitoring and documentation.
Technical Qualifications
3 - 4+ years in DevOps / SRE / Platform roles supporting production systems at scale.Strong with AWS : VPC, IAM, ECS / EKS, ALB / NLB, RDS / Elasticache / Object storage, CloudWatch.Proficient in Terraform (or CDK / CloudFormation), CI / CD (GitHub / GitLab / Jenkins / Argo) including artifacts and environment promotion.Containers & orchestration : Docker , task definitions / helm charts, autoscaling, health checks, readiness / liveness.Observability : Prometheus / Grafana, OpenTelemetry, log pipelines (ELK / CloudWatch / Datadog), alert routing.Networking & security : VPC / Subnets, SGs / NACLs, TLS, DNS, WAF, IAM design , secrets (KMS / Parameter Store / Vault).Scripting / automation in Python / Bash , configuration management (Ansible or equivalent).Proven incident management : on-call practice, runbooks, RCAs, tuning alerts to reduce noise.Nice to Have
Kubernetes (EKS) production experience, service mesh (Istio / Linkerd), GitOps (ArgoCD / Flux).Image and dependency security (Trivy / Grype / Snyk), SBOMs, policy-as-code (OPA / Conftest).Data platform ops (Postgres / Mongo backups, PITR, replicas), streaming (Kafka / Kinesis).Edge / GPU workloads (Triton / TorchServe) and autoscaling for AI inference.About the Company
Griphic is founded by IIT Delhi engineers with a vision to enrich lives through technological innovation. We combine cutting-edge AI with hyper-realistic virtual experiences to solve problems and disrupt industries. Our team includes IIT Delhi engineers, AI / ML experts, VR developers, and 3D specialists. Backed by SKETS Studio (700+ professionals in BIM, architecture, VR, and 3D visualization), we are building the future of immersive web applications.
Follow Griphic to stay updated on upcoming roles and projects.