Key Responsibilities :
SRE & DevOps Strategy :
- Design and develop a robust SRE ecosystem following industry best practices.
- Formulate SRE strategies based on emerging trends and organizational needs.
- Implement best practices into local functional teams for consistent adoption.
Platform & Automation :
Develop scaffolding libraries for seamless interaction with cloud-native components.Prioritize automation to reduce operational toil, improve reliability, and gain efficiency.Investigate new technologies for automated onboarding into the cloud ecosystem.Cloud & Infrastructure Management :
Administer Kubernetes clusters, including creating operators / controllers for automation.Implement Infrastructure as Code (IaC) and Configuration as Code using AWS CDK, Terraform, or Ansible.Manage AWS cloud services including Networking, EKS ecosystem, Security, Application Integration, and Observability.CI / CD & Observability :
Work with CI / CD pipelines using Docker, AWS CodeBuild, CodePipeline, GitHub Workflows.Implement monitoring and observability using tools like Prometheus, Grafana, Dynatrace, Splunk, AWS CloudWatch.Utilize service mesh tools such as Istio for microservices-based architectures.Collaboration & Communication :
Collaborate with product, technology, and operations teams across time zones.Maintain excellent written, spoken, and presentation skills in English.Provide guidance and documentation for onboarding applications onto the SRE platform.Qualifications & Experience :
Strong experience in Kubernetes ecosystem, cluster administration, and automation.Proficiency in programming languages such as TypeScript or Go Lang.Deep understanding of AWS cloud services and ecosystem.Experience with Infrastructure as Code (AWS CDK, Terraform, Ansible).Familiarity with CI / CD concepts, automation tools, monitoring tools, and service mesh is a plus.Skills Required
Kubernetes, Aws Cloud, AWS EKS, Typescript, Devops