Talent.com
TMUS Global Solutions
Principal Architect, Systems [T500-24298]TMUS Global Solutions • Delhi, India
No longer accepting applications
Principal Architect, Systems [T500-24298]

Principal Architect, Systems [T500-24298]

TMUS Global Solutions • Delhi, India
24 days ago
Job description
About T-Mobile: T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional service experience.

About TMUS Global Solutions : TMUS Global Solutions

is a world-class technology powerhouse accelerating the company’s global digital transformation. With a culture built on growth, inclusivity, and global collaboration, the teams here drive innovation at scale, powered by bold thinking. TMUS India Private Limited operates as TMUS Global Solutions

JOB SUMMARY: Responsible for rapid and accurate production triage, deep-rooted incident analysis, and driving proactive reliability improvements across cloud-native ecosystems. This role exists to ensure that T-Mobile’s most critical systems remain resilient, scalable, and observable at all times. While hands-on engineering is a component of the role, the majority of time is spent providing technical leadership—conducting architecture reviews, mentoring engineers, maturing observability and reliability frameworks, performing performance diagnostics, codifying runbooks, driving incident reviews, refining operational standards, and influencing long-term reliability strategies. The SRE plays a key role in shaping reliability posture across business domains by collaborating on architectural direction, risk assessments, capacity planning, and cross-functional resiliency initiatives.

Key Responsibilities: Lead real-time production triage for high escalated incidents (app, platform, network, data) and driving mitigation or failover. Design and evolve end-to-end observability (structured logs, metrics, traces, events, correlation IDs) to cut MTTD and eliminate blind spots. Perform deep performance engineering (latency breakdown, GC/heap tuning, thread/async analysis, CPU/memory/I/O profiling) and eliminate tail latency. Analyze incident and alert trends to remove systemic failure modes and reduce repeat occurrences and noisy alert sources. Provide recommendations on optimizing Kubernetes workloads (resource requests/limits, HPA, pod disruption budgets, affinity/anti-affinity, ingress, service mesh traffic) for resilience and efficiency. Build automation and self-healing (runbook codification, dependency health probes, pre-flight deployment guards, drift and config integrity checks). Work for post-incident reviews, producing clear causal chains, durable remediation actions, and tracked ownership to closure. Enhance release and change safety with automated rollback and SLO guardrails. Drive capacity and scalability planning (forecast saturation, right-size clusters, assess quota limits, model concurrency vs throughput) to prevent resource exhaustion. Maintain authoritative runbooks, architecture dependency maps, DR playbooks, and reliability scorecards for transparency and onboarding speed. Partner with development, platform, security, and data teams to embed reliability patterns (idempotency, bulkheads, circuit breakers, backpressure) early in design. Proactively surface emerging risks (error budget degradation, scaling inflection points, capacity shortfalls, aging certificates) before they become incidents.

Must Have: Production triage and troubleshooting and problem-solving skills and incident communication clarity (concise timeline narration, stakeholder updates, executive summaries, remediation advocacy). Strong production Kubernetes expertise (controllers, scheduling behavior, networking, ingress, service mesh, resource tuning, multi-cluster operations), preferred CKAD or CKA certified. Proficiency in any one language Java or Go or Python for building diagnostic tooling, automation services, performance harnesses, and reliability utilities. Solid database and SQL capability (query tuning, indexing, execution plan analysis) plus familiarity with at least one NoSQL or caching layer (Dynamo, Mongo). Deep observability stack usage (Splunk, Prometheus, Grafana, Open Telemetry, tracing systems, APM tools) and alert noise reduction techniques. Performance profiling mastery (async-profiler, flame graphs, thread and heap dumps, network and syscall analysis). Strong Linux/Unix internals knowledge (process scheduling, cgroups, kernel signals, network stack, filesystem and I/O, perf/strace/tcpdump/iostat/sar tooling). Automation and infrastructure-as-code experience (Ansible, Helm, GitOps pipelines, CI/CD gating, self-heal workflows). Strong log, metric, and trace correlation skills for root cause isolation across microservices, queues, caches, and external dependencies. Messaging and event streaming familiarity (Kafka, SQS, RabbitMQ) including lag analysis, consumer scaling, ordering, and replay strategies. Ownership mindset with collaborative influence, mentoring peers in production debugging, reliability principles, and continuous improvement discipline.

Additional Skills: Practical SRE framework implementation (SLI taxonomy, SLO lifecycle, error budget policies, toil reduction, reliability scorecards). Distributed systems resilience patterns (circuit breakers, retries with jitter, timeouts, bulkheading, idempotent semantics, backpressure, graceful degradation).

Nice to have: Hands-on multi-region AWS and/or Azure experience (load balancing, autoscaling, Route53/DNS/Azure DNS, storage replication, DR and failover orchestration). Demonstrated proactive risk identification (capacity hotspots, noisy dependencies, cascading failure precursors, config drift, expiring certs/secrets).

Create a job alert for this search

Principal Architect, Systems [T500-24298] • Delhi, India

Similar jobs

AI Systems Architect

Pine LabsNoida, Republic Of India, IN

We are looking for a sharp, self-driven.AI as a tool — but thinks, builds, and ships.This is not a traditional engineering role.You will operate at the intersection of deep technical execution and ... Show more

 • Promoted

Partner Solution Architect

ScaleneWorksNoida, Uttar Pradesh, India
Quick Apply

As a Partner Solution Architect, you will be an integral part of solutioning community helping customers, from small to global enterprises, scale, address their business, strategy, and technology n... Show more

Architect

Paisalo Digital Limiteddelhi, delhi, in

We are seeking a highly skilled and experienced Architect to lead architectural design and project management for interior and construction projects.The ideal candidate will have a strong backgroun... Show more

 • Promoted

Robotics Tech Lead / Architect

Innefu Labsnew delhi, delhi, in

We are seeking a visionary Robotics Tech Lead / Architect to spearhead the technical design and development of our humanoid robotics platform.This is a founding leadership role that will define the... Show more

 • Promoted

Data Systems Architect

Shiv Nadar SchoolNoida, Republic Of India, IN

The Data Engineer will play a key role in building, managing, and optimizing data systems and pipelines that support decision-making across the Shiv Nadar School ecosystem.This role involves creati... Show more

 • Promoted

Enterprise Architect

BrainWave Professionalsnoida, delhi, in

Job Title-Enterprise Architect.We are looking for a hands-on Enterprise Architect who can lead end-to-end digital and.You will define capability models, application portfolios, and integration arch... Show more

 • Promoted

IT Systems Architect

Innovation Technology By DesignNoida, Republic Of India, IN

We are looking for an experienced.Windows Servers (AD, DNS, DHCP).Maintain documentation and provide on-call support.IT support / system admin experience.Windows Server & Active Directory.Basic net... Show more

 • Promoted

Senior Specialist - Architecture

Vriba Solutionsnoida, delhi, in

Job Title: Senior Specialist - Architecture.Good Communication Verbal Writing.Service Delivery Management skills.Team Management and workload tracking and sharing status with Client.Managed P1P2 is... Show more

 • Promoted

IBM Senior Systems Administrat

Arganonoida, delhi, in

We are seeking a highly skilled and experienced.In this role, you will be responsible for managing and maintaining our clients' IBM systems, ensuring optimal performance, security, and reliability.... Show more

 • Promoted

Principal Oracle SCM Solutions Architect

Trangile ServicesNoida, Republic Of India, IN

We are seeking a highly skilled Senior Fusion SCM Functional Consultant to join our team.The ideal candidate will have a strong background in Oracle Fusion SCM modules, extensive experience in SCM ... Show more

 • Promoted

Principal SuccessFactors Solution Architect

HCLTechNoida, Republic Of India, IN

Role owns the complete SAP SuccessFactors suite at an enterprise level from a Business or IT ownership standpoint with strong hands-on involvement across modules.Mandatory hands-on experience in Su... Show more

 • Promoted

Principal Project Architect

Inbuilt StudioNoida, Republic Of India, IN

Inbuilt Studio specializes in designing and transforming spaces through innovative architecture, captivating interior design, and full turnkey solutions.We cater to commercial, residential, and hos... Show more

 • Promoted

SaaS AI Architecture Principal

ISIR AINoida, Republic Of India, IN

We are looking for a highly capable, platform-first engineering leader who can help us evolve our current product into a scalable, production-grade AI platform.This is not a feature-delivery-only r... Show more

 • Promoted

Principal AI Systems Developer

CoforgeNoida, Republic Of India, IN

Coforge is looking for a Gen AI engineer to build both Traditional AI/ML as well as Generative AI and Agent based systems.The candidate should be highly skilled in programming & statistics with the... Show more

 • Promoted

Enterprise Data Architect

Dautomghaziabad, uttar pradesh, in

Architecture Strategy & Future‑State Design.Define future‑state architecture, principles, and platform strategy.Drive modernization, consolidation, and retirement of legacy technologies.Produce arc... Show more

 • Promoted • New!

Principal Data Architect

Aaysghaziabad, uttar pradesh, in

Aays is a fast-growing data science and artificial intelligence firm that collaborates with Fortune 1000 companies to deliver innovative data-driven solutions.Renowned for its expertise in leveragi... Show more

 • Promoted

Principal AI Platform Engineer

Axtria - Ingenious InsightsNoida, Republic Of India, IN

We are looking for an AI Architect to lead the design and delivery of AI-first commercial data platforms for life sciences clients.The solutions range from AI powered self-serve front end applicati... Show more

 • Promoted

Enterprise Architect

ScaleneWorksNoida, Uttar Pradesh, India
Quick Apply

An Enterprise Architect is a practitioner of enterprise architecture which is a business management discipline that operates within large enterprises.Enterprise architects work with stakeholders, b... Show more

Enterprise Architect

CDWghaziabad, uttar pradesh, in

At CDW, we make it happen, together.Trust, connection, and commitment are at the heart of how we work together to deliver for our customers.It’s why we’re coworkers, not just employees.Coworkers wh... Show more

 • Promoted

Enterprise Architect – Cloud Architect

LanceSoft Middle Eastnew delhi, delhi, in

Job Title - Enterprise Architect – Cloud Architect.Required Qualifications and Skills.Bachelor’s degree in Computer Science, Information Technology, or a related field (or equivalent experience).En... Show more