Role : Google Cloud SRE Engineer
We are seeking an exceptional Google Cloud SRE Engineer to join our engineering team.
This role requires a highly skilled professional with deep expertise in Google Cloud Platform (GCP), Kubernetes, Infrastructure as Code, and CI / CD automation.
The ideal candidate thrives in high-pressure production environments, excels at automation, and continuously drives improvements in system reliability, scalability, and operational efficiency.
Title : Google Cloud SRE Engineer.
Location : Remote Work.
Employment Type : Full Time.
No of Openings : 2.
Timings : 24-7 (rotational Shifts).
Key Responsibilities :
- Ensure the reliability, availability, and performance of production systems hosted on GCP.
- Lead incident response and troubleshooting efforts for critical production issues.
- Perform root cause analysis and implement long-term fixes to prevent recurrence.
- Champion monitoring, alerting, and observability practices to enhance system resilience.
Programming & Automation
Develop and maintain automation tools, scripts, and services using Python, Go, and Bash.Identify repetitive operational tasks and convert them into automated workflows.Build scalable, robust solutions to reduce operational toil and improve reliability.Google Cloud Platform (GCP)
Architect, deploy, and optimize production-grade workloads on GCP.Ensure adherence to GCP best practices, cost optimization strategies, and security compliance.Continuously evaluate and adopt emerging GCP services to enhance cloud operations.Kubernetes (GKE)
Manage and optimize large-scale GKE clusters.Implement deployment strategies, resource management, and cluster security.Troubleshoot complex issues in containerized workloads and cluster environments.CI / CD & Infrastructure as Code
Design, implement, and maintain CI / CD pipelines using Jenkins, GitLab CI, or GitHub Actions.Define and manage cloud infrastructure using Terraform, including reusable and modular configurations.Collaborate with developers to ensure seamless integration and automated testing.Required Skills & Experience :
Programming / Scripting : Expert in Python, Go, and Bash with proven automation portfolio.GCP : 2+ years of hands-on GCP experience with deep understanding of its services and architecture.Kubernetes (GKE) : Advanced experience in managing production clusters, deployments, and troubleshooting.CI / CD : Strong expertise with Jenkins, GitLab CI, or GitHub Actions; proven history of building enterprise-grade pipelines.Terraform : Proficiency in Infrastructure as Code with Terraform, including reusable and modular configurations.Incident Response : Demonstrated excellence in handling critical production incidents and performing RCA.Automation-First Mindset : Consistent track record of converting manual tasks into automated workflows.AI Integration : Awareness and experience in applying AI / ML tools in DevOps practices is a strong plus.Preferred Qualifications
GCP Professional Cloud DevOps Engineer or Architect certification.Experience with monitoring / observability tools (Prometheus, Grafana, ELK, Stackdriver).Exposure to service mesh technologies (Istio, Linkerd).Familiarity with security practices such as IAM, workload identity, and secrets management.(ref : hirist.tech)