We are looking for a CloudOps Trainee to join our team as a Platform Operations Engineer. This is a role for a detail-oriented professional with 0-2 years of experience who is eager to learn and contribute to the stability and performance of a critical platform infrastructure. You will work closely with various teams to ensure a reliable and efficient production environment.
Key Responsibilities
- Platform Maintenance & Monitoring : You will be responsible for the day-to-day operations and maintenance of the platform, including conducting regular system health checks, capacity planning, and performance tuning.
- Incident Response : Lead the incident response process, which includes troubleshooting, performing root cause analysis, and documenting all incidents. You will also help develop and maintain incident response plans to ensure rapid issue resolution.
- Automation & Scripting : Develop and maintain scripts and automation tools using languages like Python or Bash to streamline platform operations and reduce the likelihood of human error.
- Deployment Support : Collaborate with development and DevOps teams to support the deployment of new features, updates, and fixes with minimal impact on platform performance.
- Security & Compliance : Ensure the platform complies with industry standards and regulations. You will also work with the security team on regular security assessments and audits.
- Documentation & Reporting : Maintain comprehensive documentation of platform configurations and incident reports. You will also provide regular reports on platform performance to management.
Required Qualifications
Experience : 0 to 2 years of experience.Technical Skills :Experience managing cloud-based platforms ( AWS, Azure, GCP ) and working with virtualization technologies ( Docker, Kubernetes ).Strong knowledge of Linux / Unix systems and networking fundamentals.Proficiency in scripting languages such as Python and Bash .Familiarity with CI / CD pipelines and deployment automation.Soft Skills :Strong problem-solving skills and the ability to work under pressure.Excellent communication and teamwork skills.Additional Requirements : The ability to work in rotational shifts and be available for on-call rotation for incident management.Skills Required
Python Programming, Bash Scripting, cloud platform