About T-Mobile :
T-Mobile US, Inc. (NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional service experience.
About TMUS Global Solutions :
TMUS Global Solutions is a world-class technology powerhouse accelerating the company’s global digital transformation. With a culture built on growth, inclusivity, and global collaboration, the teams here drive innovation at scale, powered by bold thinking.
TMUS India Private Limited operates as TMUS Global Solutions.
About the Role :
We are building automation-first platforms that power scalability, resiliency, and developer efficiency across the digital ecosystem. As a Senior Engineer – Platform Automation, you will be a key member of the CFL Platform Engineering and Operations team ,you will help design and implement automation services, self-service tooling, and event-driven workflows that reduce operational toil and improve engineering velocity.
This is a hands-on engineering role focused on infrastructure and platform automation. You’ll collaborate closely with DevOps, SRE, cloud, and security teams to deliver reusable automation frameworks that integrate deeply with CI / CD pipelines, observability systems, and infrastructure-as-code practices.
What You’ll Do :
- Build and maintain automation pipelines and internal tooling for provisioning, orchestration, patching, and compliance
- Develop event-driven automation workflows using CI / CD and orchestration platforms (e.g., GitLab, Jenkins, Argo Workflows)
- Create reusable scripts and templates enabling self-service operations for platform users
- Author and maintain IaC modules using Terraform, Helm, or Ansible
- Promote GitOps adoption to ensure change traceability, testability, and reliability
- Automate Kubernetes resource provisioning and deployment configurations
- Integrate telemetry and alerting to support traceability and self-healing systems
- Build automated incident remediation, chaos testing, and recovery workflows
- Ensure automation solutions meet SLAs on availability, performance, and cost
- Collaborate with cross-functional teams to deliver integrated platform automation
- Participate in platform planning, retrospectives, and evangelize automation tooling
What You’ll Bring :
Bachelor’s degree in Computer Science, Engineering, or a related technical field4-7 years of experience in infrastructure, DevOps, or platform automation rolesProficiency in Python, Go, or Bash for scripting and automationStrong experience building CI / CD pipelines with GitLab, Jenkins, etc.Deep hands-on expertise with Infrastructure-as-Code tools like Terraform, Helm, AnsibleSolid understanding of Kubernetes, container orchestration, and deployment strategiesFamiliarity with asynchronous or event-driven automation frameworksMust Have Skills :
Application & Microservice : Java, Spring boot, API & Service DesignAny CI / CD Tools : Gitlab Pipeline / Test Automation / GitHub Actions / Jenkins / Circle CIApp Platform : Docker & Containers (Kubernetes)Any Databases : SQL & NOSQL (Cassandra / Oracle / Snowflake / MongoDB)Any Messaging : Kafka, Rabbit MQAny Observability / Monitoring : Splunk / Grafana / Open Telemetry / ELK Stack / Datadog / New Relic / Prometheus)AI / Machine learning : Anomaly detectionNice To Have :
Auto-remediation with guardrailsDetection strategy (precision / recall, FP reduction)KPI ownership : alert noise, MTTR, MTTKPartner with SRE / DevOpsTelemetry unification across AWS / Azure / Databricks