Role Overview :
We are seeking a seasoned Senior DevOps Engineer and Infrastructure Architect to lead the design, deployment, and governance of our enterprise-grade infrastructure. This individual will serve as a critical enabler of operational excellence, driving automation, scalability, and resiliency across a multi-cloud and hybrid landscape with a strong emphasis on AWS and on-premise environments.
The ideal candidate is both a strategic architect and a hands-on technologist, capable of influencing cross-functional engineering teams, enforcing DevSecOps practices, and enabling robust platforms for application delivery, data infrastructure, and AI / ML systems.
Having Very good BFSID Domain project experience
Key Responsibilities :
Cloud & Infrastructure Architecture :
- Design and implement secure, resilient, and high-performance cloud architectures on AWS, while supporting integration with GCP, Azure, and on-premise infrastructure (e.g., VMware, OpenStack).
- Define hybrid cloud strategies that address security, network segmentation, identity federation, and data governance across environments.
- Develop infrastructure blueprints and reference architectures that align with business and technical requirements.
Infrastructure as Code & Automation :
Champion Infrastructure as Code (IaC) using Terraform, CloudFormation, or Pulumi for scalable and repeatable provisioning.Automate environment creation, configuration management, and orchestration workflows using Ansible, Helm, or equivalents.Establish GitOps-based pipelines for environment consistency, change tracking, and governance.DevOps & Continuous Delivery :
Architect, implement, and manage enterprise-grade CI / CD pipelines using tools like GitHub Actions, GitLab CI, Jenkins, or ArgoCD.Drive DevSecOps adoption by embedding security, compliance checks, and observability into the software delivery lifecycle.Enable release strategies such as blue / green, canary deployments, and feature flagging.Hybrid Infrastructure & On-Premise Integration :
Lead the integration and optimization of on-prem systems with cloud-native services, ensuring seamless connectivity, policy alignment, and resource efficiency.Manage infrastructure for container platforms, virtualized environments, and legacy applications within private datacenters.Enforce standardized disaster recovery (DR), backup, and failover strategies across hybrid deployments.Monitoring, SRE, and Reliability Engineering :
Define and monitor SLAs, SLIs, and SLOs across services; implement proactive alerting and auto-remediation strategies.Operationalize observability using platforms like Prometheus, Grafana, ELK, CloudWatch, and Datadog.Drive incident response, root cause analysis (RCA), and post-mortem processes to ensure continuous improvement.AI / ML Platform Enablement (Preferred) :
Collaborate with data engineering and machine learning teams to provision infrastructure optimized for AI / ML pipelines, GPU workloads, and data lakes / pools.Support orchestration frameworks such as Kubeflow, MLflow, Airflow, and cloud-native ML services (e.g., SageMaker, Vertex AI).Optimize infrastructure for data ingestion, feature engineering, and real-time inference workflows.Required Qualifications :
8+ years of experience in DevOps, Site Reliability Engineering, or Infrastructure Architecture roles.Deep technical proficiency with AWS, coupled with working experience across GCP, Azure, or private cloud stacks.Expert-level skills in Linux systems, containerization (Docker, Kubernetes), and networking / security best practices.Hands-on experience with infrastructure automation, CI / CD tools, and scripting (e.g., Python, Go, Shell).Strong foundation in cloud security (IAM, VPC, KMS, WAF, GuardDuty), encryption, identity, and compliance frameworks.Preferred Qualifications (Added Advantages) :
AWS Certifications (DevOps Engineer Professional, Solutions Architect Professional).Experience managing data platforms, AI / ML pipelines, and high-volume data lake architectures.Familiarity with enterprise ITIL, SRE principles, and compliance mandates (e.g., ISO 27001, SOC2, GDPR).Experience in cost optimization, cloud spend governance, and FinOps best practices.(ref : hirist.tech)