Job Description – DevOps Engineer
We are seeking a highly skilled
DevOps Engineer
to join our dynamic team. The ideal candidate will have strong expertise in automation, cloud infrastructure (with a focus on AWS and GenAI services), CI / CD, and containerization, along with a deep understanding of security best practices, monitoring, and system optimization. This role requires a balance of technical proficiency, problem-solving, and collaboration skills to ensure smooth deployment, scalability, and reliability of applications and infrastructure.
Key Responsibilities
Design, automate, and manage scalable, secure, and high-availability cloud infrastructure.
Implement Infrastructure as Code (IaC) using tools like Terraform or CloudFormation.
Develop and maintain CI / CD pipelines with Jenkins, GitLab CI, CircleCI, or AWS CodePipeline.
Automate routine tasks using Python and shell scripting.
Monitor and optimize system performance using Prometheus, Grafana, ELK stack, or AWS CloudWatch.
Manage databases (MySQL, PostgreSQL, MongoDB, DynamoDB), including backup, recovery, and performance tuning.
Deploy and manage web applications on production environments with Nginx, Apache, or similar servers.
Ensure cloud, networking, and server security using IAM, VPC, security groups, and firewalls.
Manage source control and team collaboration using Git and branching strategies.
Work with containerization and orchestration technologies (Docker, Kubernetes, ECS).
Implement disaster recovery, backup, and high-availability strategies.
Troubleshoot incidents, perform root cause analysis, and implement preventive measures.
Collaborate with cross-functional teams, ensuring effective communication and documentation.
Required Skills & Experience
1. Automation & Infrastructure as Code (IaC)
Hands-on experience with Terraform, AWS CloudFormation, or similar.
Proficient in automating infrastructure deployment and management tasks.
Knowledge of configuration management tools (Ansible, Chef, Puppet).
2. Monitoring & Logging
Experience with monitoring tools (Prometheus, Grafana, ELK Stack, AWS CloudWatch).
Ability to set up alerts, dashboards, and audit logs for system health and performance.
3. Cloud Platforms (AWS – Must Have GenAI Services Experience)
Strong knowledge of AWS services : EC2, S3, RDS, Lambda, Bedrock, OpenSearch, Knowledgebase, IAM, VPC, CodeDeploy, CodePipeline, SQS, etc.
Familiar with cloud-native architectures and multi-cloud environments (a plus).
4. Scripting & Automation
Python :
Scripting, automation, Boto3 for AWS, Flask / Django familiarity (bonus).
Shell scripting :
Strong skills in bash or similar for deployment and system automation.
5. Database Management
Experience with MySQL, PostgreSQL, MongoDB, DynamoDB.
Backup, recovery, performance tuning, and database security best practices.
6. Web Application Deployment & Server Management
Experience with production deployments, web / application servers (Nginx, Apache).
Knowledge of reverse proxies, SSL / TLS setup, and security hardening.
7. Security & Networking
Cloud security best practices, IAM management, firewalls, and VPC configurations.
Strong understanding of TCP / IP, DNS, HTTP / HTTPS, and load balancer setups.
8. CI / CD & Version Control
Proficient in Git workflows (GitFlow, trunk-based) for multi-team management.
Experience with CI / CD pipelines (Jenkins, GitLab CI, AWS Code Pipeline).
Knowledge of containerization (Docker) and orchestration (Kubernetes, ECS).
9. High Availability & Scaling
Load balancing strategies (AWS ELB, HAProxy), failover planning.
Auto-scaling in cloud platforms and performance optimization.
10. Backup, Recovery & Incident Response
Implementation of disaster recovery, redundancy strategies, and system resilience.
Troubleshooting, root cause analysis, and preventive measures.
11. Collaboration & Project Management
Strong communication and documentation skills.
Ability to collaborate across teams and explain technical concepts to non-technical stakeholders.
Familiarity with Agile methodologies (Scrum, Kanban) and tools (Jira, Trello) is a plus.
Preferred / Optional Skills
Dockerfile and Docker Compose creation for multi-container applications.
Serverless architecture with AWS Lambda, SQS, SNS.
Project management and task prioritization.
Sr Engineer • India