Job Description :
We are hiring an experienced Individual Contributor (IC) to join our DevOps engineering team supporting enterprise-grade platforms across cloud and hybrid infrastructure.
The role demands deep expertise in cloud provisioning (primarily Azure), CI / CD automation, Kubernetes container orchestration, observability integration, and production reliability operations.
The candidate must have strong hands-on experience working across the full DevOps lifecycle from environment setup and infra-as-code scripting to troubleshooting production issues.
Comfort with scripting, toolchain integration, and infrastructure security is expected. Prior experience in BFSI or regulated environments will be a Skills & Required Depth Cloud Infrastructure :
- Independently provisioned core Azure services VMs, VNets, storage accounts, NSGs, load balancers using ARM / Bicep templates or CLI. Has configured hybrid connectivity, resource locks, and tagging standards. Familiar with IAM roles, service principals, and key vault (Azure DevOps / Jenkins) :
- Built and maintained pipelines supporting code build, artifact versioning, multi-stage deployment, test execution, and rollback logic. Experience with gated check-ins, pipeline-as-code (YAML), and integration of security scans. Has diagnosed pipeline failures and optimized for (Kubernetes / Docker) :
- Created Dockerfiles and Helm charts for application packaging. Deployed microservices to AKS or self-managed K8s clusters.
- Familiar with resource quotas, pod autoscaling, rolling deployments, readiness / liveness probes, and service mesh basics.
- Able to troubleshoot deployment and ingress as Code (Terraform / Ansible) :
- Authored modular Terraform configurations with environment-specific variables. Handled remote state, resource dependency graphs, and drift detection. If using Ansible, has implemented idempotent playbooks with custom roles and inventory (Bash / PowerShell / Python) :
- Written automation scripts to support log rotation, backup orchestration, cleanup tasks, and validation hooks.
- Able to handle conditional logic, error trapping, logging, and parameterization in at least one scripting language.
- Scripts used in CI / CD hooks or bootstrapping (Monitoring & Alerting) :
- Integrated monitoring tools like Prometheus, Grafana, Azure Monitor, or ELK stack.
- Configured scrape jobs, alert rules, dashboard templates, and escalation policies.
- Experience with APM tools (e.g., New Relic, AppInsights) or log parsing for incident Troubleshooting (Linux / Windows) :
- Proactively debugged failures in distributed systems involving containers, networks, services, or virtual machines.
- Familiar with analyzing event logs, journald / syslog, system metrics, and application error traces.
- Has led or supported root cause analysis (RCA) Control (Git) :
- Proficient in Git workflows (feature branches, squash merges, tagging, and rebasing).
- Experienced in managing code repositories, branch protection rules, and resolving conflicts during parallel development.
- Has integrated version control into CI & Incident Response :
- Participated in rotation-based support.
- Has handled priority incidents, performed triage, updated stakeholders, and restored services under pressure.
- Experience with incident runbooks, war room participation, and postmortem Skills Directory & On-Prem Infra :
- Exposure to managing AD domains, DNS zones, DHCP, and file services in a Windows Server environment.
- Has integrated AD with cloud IAM (e.g., Azure AD Connect).
- Basic understanding of group policy objects (GPOs) and OU Certifications :
- Possession of certifications such as Microsoft Certified : Azure DevOps
- Engineer Expert or AWS Certified DevOps Engineer adds value but is not required.
- Demonstrates structured learning and validation of Design & Dashboards :
- Involved in defining observability KPIs, selecting relevant metrics, configuring long-term retention, and templating dashboards for engineering use.
- Experience correlating logs, metrics, and traces Background or Build System Knowledge :
- Experience integrating CI / CD pipelines with build systems like Maven, Gradle, MSBuild, or npm.
- Understanding of versioning, dependency resolution, and build caching strategies. Exposure to unit / integration test execution in / DR Tooling :
- Familiarity with enterprise backup solutions (e.g., Veeam, Commvault) or Azure Site Recovery.
- Has assisted in defining RPO / RTO targets, restoring data snapshots, or participating in DR drills for production systems.
ref : hirist.tech)