At Nebula Tech Solutions , weโre expanding our global reliability engineering team to support mission-critical systems for our US-based enterprise clients during night shifts only .
Weโre looking for experienced DevOps / SRE professionals (5+ years) who bring hands-on depth in Kubernetes, monitoring / metrics, and coding โ not just infrastructure management.
This is a role for engineers who thrive on troubleshooting, automation, and continuous improvement in high-availability environments. ๐๐
๐ง What Youโll Do
โ Build, optimize, and maintain Kubernetes clusters (EKS / GKE / AKS) for scalability and resilience
โ Design and improve CI / CD pipelines (Jenkins, ArgoCD, FluxCD, Harness, GitHub Actions)
โ Implement and extend observability using Prometheus, Grafana, OpenTelemetry, and custom metrics
โ Develop and maintain internal tools and automations using Python, Go, or similar programming languages
โ Drive incident response, RCA, and reliability improvements across services
โ Collaborate with global teams to ensure continuous uptime and performance
๐งฉ What Weโre Looking For
๐น 5+ years of DevOps / SRE / Platform Engineering experience
๐น Deep, hands-on knowledge of Kubernetes architecture, deployments, debugging, and scaling
๐น Strong programming or scripting skills in Python, Go, Java, or Node.js (beyond shell scripting)
๐น Proven experience with monitoring and telemetry systems (Prometheus, Grafana, ELK, OpenTelemetry)
๐น Understanding of web services, REST APIs, and distributed systems troubleshooting
๐น Familiarity with Terraform, Helm, and GitOps workflows (FluxCD / ArgoCD)
๐ Bonus Points
๐ Location : Remote (India)
๐ Shift : US Night Shift (Continuous)
๐ Client : US-based Enterprise (Global Scale)
If you love solving complex reliability challenges, enjoy scripting and building automation, and want to work with globally distributed systems โ weโd love to hear from you. ๐
Sr Engineer โข palakkad, kerala, in