Platform Engineer / DevOps Engineer
About StackGen :
StackGen is a rapidly growing product company focused on building scalable, high-performing backend systems and next-generation infrastructure solutions. We are passionate about empowering teams through cloud-native technologies, DevOps best practices, and resilient architecture. At StackGen, we value innovation, collaboration, and the drive to solve complex technical challenges with clean, elegant solutions.
Role Overview :
We are seeking a skilled and experienced Platform Engineer / DevOps Engineer to design, build, and maintain scalable infrastructure, implement DevOps and SRE practices, and ensure the reliability and performance of our systems across multiple customer environments. This role combines hands-on infrastructure engineering with automation, reliability engineering, and modern cloud platform :
- Infrastructure Ownership : Design, implement, and manage production-grade infrastructure using Infrastructure as Code (IaC) for multiple customer environments.
- Cloud & Container Management : Architect and operate highly available and scalable systems on AWS using Terraform and Kubernetes.
- SRE Practices : Implement and maintain robust SRE practices including observability, monitoring (e.g., Prometheus, SigNoz), alerting, incident response, and root cause analysis.
- Automation & CI / CD : Build and manage CI / CD pipelines using GitHub Actions, and automate infrastructure provisioning and deployment workflows.
- System Performance & Reliability : Ensure high availability, security, and performance of infrastructure through proactive measures and continual optimization.
- Collaboration : Partner with engineering teams to integrate DevOps and SRE principles across the SDLC.
- Troubleshooting : Lead complex production debugging and troubleshooting efforts to ensure system uptime.
- Documentation : Maintain comprehensive documentation for infrastructure setups, tools, and operational processes.
- On-Call Support : Participate in the on-call rotation for mission-critical infrastructure support.
- Process Improvement : Define and implement best practices around CI / CD, testing, monitoring, and deployment.
- Hands-On Development : Contribute to backend development as needed, and participate in code reviews to ensure technical excellence.
- Stakeholder Engagement : Communicate progress, risks, and technical decisions clearly to leadership and cross-functional teams.
Requirements :
3- 5 years of experience in DevOps roles, including 3+ years in a lead or ownership role.Proven track record of managing and scaling infrastructure across multiple customer environments.Expertise in AWS, Terraform, and Kubernetes in production settings.Strong understanding of SRE principles including observability, SLOs / SLIs, and incident response.Hands-on experience with monitoring & telemetry tools (e.g., Prometheus, Grafana, SigNoz, OpenTelemetry).Proficiency in scripting or programming languages like Python, Go, or Bash for automation tasks.Familiarity with modern deployment workflows using GitHub Actions or similar CI / CD tools.Solid grasp of cloud security, networking, and infrastructure best practices.Strong communication and collaboration skills.Bachelor's or Masters degree in Computer Science, Engineering, or a related field.(ref : hirist.tech)