Job title : Devops Engineer2 / 3
Location : Hyderabad / Bangalore
Experience : 6-10 years
Notice period : 30 - 60days
DevOps & Cloud Infrastructure Engineer
To help us build robust and scalable systems that improve the customer experience, we’re looking for a DevOps engineer who can be responsible for developing and provisioning infrastructure, observability platform tools such as Prometheus, Grafana, and distributed logging and tracing stacks. The ideal candidate will have a background and familiarity with Shell Scripting Python, and will work with developers and engineers to ensure that infrastructure and observability practices and processes work as intended.
Objectives of this role
- Building and implementing new DevOps tools, Terraform modules
- Work to automate and improve development and release process
- Design and implement security controls at the infrastructure layer
- Automate release across environments including disaster recovery region
Responsibilities
Deploy updates and fixes, and provide Level 2 technical supportBuild tools to reduce occurrence of errors and improve customer experienceDevelop software to integrate with internal back-end systemsDesign and implement distributed logging and tracing stackDevelop scripts to automate metrics collection, operational dashboardDesign procedures for system troubleshooting and maintenanceRequired skills and qualifications
Experience as a DevOps engineer or in a similar software engineering roleProficiency with Git version control systemGood knowledge of Shell Scripting or PythonWorking knowledge of Terraform, databases and SQLWorking knowledge of Prometheus, GrafanaProblem-solving attitude and collaborative team spiritPreferred skills and qualifications
Bachelor degree in computer science, engineering, or relevant fieldExperience in civil engineering or customer experienceExperience in developing / engineering applications for a large companyPrometheus, PromQL expressionsGrafana dashboards, PagerDuty, Jaegar (any)OpenTelemetry, OpenTracing (any)EasticSearch, LogStash, Kibana (ELK) stack big plusMicrometer, Loki, Google BigQuery logging (any)Automate failover / scale-up / scale-downAutomate operational, perf testing, activitiesLoad testing, chaos testing a plusK6 / JMeter / ChaosToolkit / GremlinHands on AWS infrastructure-as-codeOne or more of Kubernetes, Helm, Ansible, TerraformEducation : B.Tech / M.Tech