Expereince - 6 - 8 Years
Location - Hyderabad
Primary Skills
Cloud Platforms : AWS (preferred), Azure, GCP
Infrastructure as Code (IaC) : Terraform, CloudFormation
Automation & Configuration Management : Ansible, CI / CD pipelines, container orchestration tools
Programming & Scripting : Python, GoLang, Java, Perl
AI / ML & Gen AI Services : AWS Bedrock, SageMaker, NLP, and other AI / ML tools
Cloud Infrastructure Design & Management : High availability, scalability, disaster recovery planning
What You’ll Do
Collaborate with the development team to understand requirements for application infrastructure.
Design, deploy, and manage cloud infrastructure on AWS (or other cloud platforms) to ensure high availability, scalability, and performance .
Use Terraform / CloudFormation to define and maintain Infrastructure as Code (IaC), automating provisioning and deployment processes.
Develop and maintain automation scripts and playbooks using tools like Ansible to streamline configuration, management, and orchestration of resources.
Contribute to building Gen AI competency on AWS, requiring strong expertise in AI / ML services such as Bedrock, SageMaker, NLP, and other advanced AI offerings.
Establish monitoring and alerting systems to proactively identify performance issues.
Conduct load testing to verify scalability and recovery capabilities.
Write and maintain high-quality code to define, automate, and manage infrastructure.
Collaborate closely with engineers, QA analysts, and other stakeholders to address technical issues and optimize infrastructure performance.
Roles & Responsibilities
Infrastructure Management :
Design, implement, and configure networking, storage, and security policies.
Ensure scalability, reliability, and disaster recovery.
Monitor infrastructure performance through logs, metrics, and alerts; troubleshoot issues quickly.
Keep infrastructure updated with security patches, test changes before deployment.
Automation & CI / CD :
Automate testing, deployment, and configuration management using CI / CD pipelines, container orchestration, and configuration management tools.
Maintain and improve the entire product development lifecycle.
Application & Platform Health :
Maintain the stability of applications hosted on the platform.
Conduct root-cause analysis and resolve performance or infrastructure issues.
Collaboration & Innovation :
Work with cross-functional teams to deliver scalable, efficient solutions.
Stay updated with emerging tools, platforms, and practices to ensure continuous improvement.
Requirements
Bachelor’s degree in Computer Science , Engineering, or a related field.
Strong experience with cloud platforms ( AWS, Azure, GCP ) to deploy, monitor, and manage applications.
Hands-on expertise with scripting languages / frameworks ( Python, GoLang, Java, Perl ).
Deep understanding of CI / CD concepts and ability to design / manage pipelines for safe and efficient deployments.
Expertise in AI / ML services (AWS Bedrock, SageMaker, NLP, etc.) to support Gen AI initiatives and competency development.
Knowledge of networking fundamentals ( TCP / IP, DNS, HTTP ) and ability to configure secure, stable connections.
Proficiency with infrastructure automation tools (Terraform, CloudFormation, Ansible).
Strong troubleshooting and analytical skills to investigate logs, error messages, and code flow.
Experience with caching, compression, and other optimization techniques for web services.
Ability to define project goals, timelines, and resource allocation, while identifying and mitigating security threats.
Strong communication and collaboration skills.
Specialist • Hyderabad, Telangana, India