Job Summary
We are seeking an experienced AWS Dev Ops Engineer II with strong technical leadership, hands-on experience in AWS DevOps mindset. The ideal candidate will possess in-depth knowledge of AWS services, infrastructure automation, Kubernetes, along with demonstrated success in customer satisfaction and cloud deployment processes. We are looking for an OpenShift / Kubernetes Engineer with broad expertise in orchestrating, automating, and supporting cloud-native platforms and application ecosystems. You’ll ensure the reliability, availability, and security of modern software deployments by managing clusters, CI / CD pipelines, monitoring, service discovery, and ITSM workflows.
Responsibilities
- Cloud Application Development : Develop, deploy, and manage cloud applications with a focus on scalability, performance, and security in AWS environments.
- DevOps : Implement CI / CD pipelines using tools such as Git, Jenkins, and Terraform, ensuring smooth, automated deployments and infrastructure management.
- Create and update runbooks, deployment checklists, SOPs, and operational documentation for applications and infrastructure.
- Logging & Monitoring : Implement logging and monitoring using tools like Splunk and New Relic to maintain system health and performance.
- Incident Management : Apply ITIL concepts in incident, change, and problem management to maintain service reliability and operational excellence.
OpenShift / Kubernetes Cluster Operations
Cluster Management : Administer and tune clusters—manage nodes, pools, scheduling, resource quotas, and platform operators for smooth operations.Ingress & Routing : Configure routes / ingress controllers to optimize traffic flow and security.Troubleshooting & Issue Resolution
Diagnose issues such as pod evictions, image pull errors, registry access, DNS and networking faults, and persistent storage challenges.Release Support & Deployment Patterns
Design and implement blue / green, canary, and rolling deployments for risk-managed releases.Utilize feature flags and ensure rollback strategies for business continuity.Security & Compliance
Establish namespaces, RBAC, network policies, and manage secrets to enforce robust security practices and regulatory compliance.Secrets & Configuration Management
Administer Vault, KMS, and ConfigMaps for secrets and app configuration. Handle leases, renewals, encryption, and rotation.Monitor and audit access logs, approvals, and enforce dual-control procedures.API & Messaging Operations
Operate API Gateways (NGINX), Kafka, MQ—configure routing, rate limiting, back-pressure, consumer lag, and DLQ management.Troubleshoot schemas, contracts, timeouts, retries, and ensure idempotent processing.Manage backward-compatible deployments, versioning, and traffic shaping.