Designing and building infrastructure to support our AWS services and infrastructure.
Creating and utilizing tools to monitor our applications and services in the cloud, including system health indicators, trend identification, and anomaly detection.
Working with development teams to help engineer scalable, reliable, and resilient software running in the cloud.
Analyzing and monitoring performance bottlenecks and key metrics to optimize software and system performance.
Providing analytics and forecasts for cloud capacity, troubleshooting analysis, and uptime.
Qualifications
Bachelor’s degree in CS or ECE.
3+ years of experience in a DevOps Engineer role.
Strong experience in public cloud platforms (AWS, Azure, GCP), provisioning and managing core services (S3, EC2, RDS, EKS, ECR, EFS, SSM, IAM, etc.), with a focus on cost governance and budget optimization
Proven skills in containerization and orchestration using Docker, Kubernetes (EKS / AKS / GKE), and Helm
Familiarity with monitoring and observability tools such as SigNoz, OpenTelemetry, Prometheus, and Grafana
Adept at designing and maintaining CI / CD pipelines using Jenkins, GitHub Actions, GitLab CI, Bitbucket pipelines, Nexus / Artifactory, and SonarQube to accelerate and secure releases
Proficient in infrastructure-as-code and GitOps provisioning with technologies like Terraform, OpenTofu, Crossplane, AWS CloudFormation, Pulumi, Ansible, and ArgoCD
Experience with cloud storage solutions and databases : S3, Glacier, PostgreSQL, MySQL, DynamoDB, Snowflake, Redshift
Strong communication skills, translating complex technical and analytical content into clear, actionable insights for stakeholders
Preferred Qualifications
Experience with advanced IaC and GitOps frameworks : OpenTofu, Crossplane, Pulumi, Ansible, and ArgoCD
Exposure to serverless and event-driven workflows (AWS Lambda, Step Functions)
Experience operationalizing AI / ML workloads and intelligent agents (AWS SageMaker, Amazon Bedrock, canary / blue-green deployments, drift detection)
Background in cost governance and budget management for cloud infrastructure
Familiarity with Linux system administration at scale