Talent.com
This job offer is not available in your country.
Principal Engineer, Site Reliability

Principal Engineer, Site Reliability

TMUS Global SolutionsHyderabad, India
24 days ago
Job description

About the Role

The Principal Engineer, Site Reliability (SRE) will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms. This role is focused on leading the operational health of these platforms, ensuring the delivery of highly reliable financial applications and data services that meet the demanding requirements of accuracy, compliance, and availability to support business operations.

As a Principal SRE, you will build automation, implement monitoring, improve incident response, and champion DevOps practices that enable Finance and Accounting systems to operate with consistency and trustworthiness, while also coaching and mentoring junior SREs to ensure overall operational excellence.

What Youll Do

Operational Oversight : Own day-to-day operations for Accounting and Finance applications and data platforms, ensuring they run smoothly and meet business expectations.

Reliability & Availability : Ensure Accounting and Finance platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and uptime.

Automation & Efficiency : Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and operational risk.

Observability & Monitoring : Implement and maintain comprehensive monitoring, alerting, and logging for accounting applications and data pipelines (e.g., Snowflake, dbt workflows, ERP integrations).

Incident Response : Lead and participate in on-call rotations, perform root cause analysis, and drive improvements to prevent recurrence of production issues.

Operational Excellence : Establish and enforce best practices for capacity planning, performance tuning, disaster recovery, and compliance controls in financial systems.

Collaboration with Engineering & Finance : Partner with software engineers, data engineers, and Finance / Accounting teams to ensure operational needs are met from development through production.

Team Coordination : Manage workload, priorities, and escalations for operations staff and partner teams, ensuring alignment with SLAs and compliance requirements.

Security & Compliance : Ensure financial applications and data pipelines meet audit, compliance, and security requirements.

Continuous Improvement : Drive post-incident reviews, implement lessons learned, and proactively identify opportunities to improve system resilience.

Audit & Compliance Support : Ensure operational practices meet internal controls, audit requirements, and financial compliance standards.

What Youll Bring

Bachelors in Computer Science, Engineering, Information Technology, or related field (or equivalent experience).

7-12 years of experience in Site Reliability Engineering, DevOps, or Production Engineering, ideally supporting financial or mission-critical applications.

Strong experience with monitoring / observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent).

Hands-on expertise with CI / CD pipelines, automation frameworks, and IaC tools (Terraform, Ansible, GitHub Actions, Azure DevOps, etc.).

Familiarity with Snowflake, dbt, and financial system integrations from an operational support perspective.

Strong scripting / programming experience (Python, Bash, Go, or similar) for automation and tooling.

Proven ability to manage incident response and conduct blameless postmortems.

Experience ensuring compliance, security, and audit-readiness in enterprise applications.

Must Have Skills

SRE

SQL

Snowflake OR Databricks

DevOps OR CICD OR Github Actions

monitoring / observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent)

Automation

Nice To Have

Experience supporting financial applications (ERP, revenue recognition systems, accounting platforms).

Exposure to FinOps practices for optimizing cloud spend in finance-related platforms.

Familiarity with containers and orchestration (Docker, Kubernetes).

Experience building resilience into data pipelines and ensuring auditability for accounting data.

Strong communication skills to articulate operational issues and risks to both technical and non-technical stakeholders.

Create a job alert for this search

Site Reliability Engineer • Hyderabad, India

Related jobs
  • Promoted
Engineer, Site Reliability [T500-20520]

Engineer, Site Reliability [T500-20520]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

HuntingCube Recruitment SolutionsHyderabad, Telangana, India
Job opening for Lead, Tech (Site Reliability Engineering) – Systems Strict Eligibility Criteria – Please Read Before Applying This role is with a leading global High-Frequency Trading (HFT) firm ...Show moreLast updated: 13 days ago
  • Promoted
Engineer, Site Reliability [T500-20266]

Engineer, Site Reliability [T500-20266]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 21 days ago
  • Promoted
Engineer, Site Reliability [T500-20519]

Engineer, Site Reliability [T500-20519]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
Engineer, Site Reliability [T500-20503]

Engineer, Site Reliability [T500-20503]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
  • New!
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Sapaadsecunderabad, telangana, in
Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering.F&B businesses across 40+ countries. Driven by a passionate team of developers, designers, a...Show moreLast updated: 18 hours ago
  • Promoted
Sr Engineer, Site Reliability Engineer [T500-20464]

Sr Engineer, Site Reliability Engineer [T500-20464]

ANSRhyderabad, telangana, in
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 14 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

TalentiserHyderabad, Telangana, India
Hiring hybrid Site Reliability Engineers for a.Our SaaS platform is designed for high performance, reliability, and automation at scale. You’ll apply engineering principles to operational challenges...Show moreLast updated: 1 day ago
  • Promoted
Engineer, Site Reliability [T500-20504]

Engineer, Site Reliability [T500-20504]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
Engineer, Site Reliability [T500-20502]

Engineer, Site Reliability [T500-20502]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ValueMomentumHyderabad, Telangana, India
Site Reliability / Azure DevOps Engineer with Dynatrace Experience.CI / CD practices, infrastructure automation, and cloud operations. The ideal candidate will have deep expertise in Azure DevOps, Inf...Show moreLast updated: 6 days ago
  • Promoted
Sr Engineer, Site Reliability [T500-20279]

Sr Engineer, Site Reliability [T500-20279]

ANSRHyderabad, Hyderabad (district), India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 4 days ago
  • Promoted
Principal Engineer, Site Reliability [T500-20295]

Principal Engineer, Site Reliability [T500-20295]

ANSRHyderabad, Telangana, India
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 13 days ago
  • Promoted
Engineer, Site Reliability [T500-20518]

Engineer, Site Reliability [T500-20518]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
Engineer, Site Reliability [T500-20517]

Engineer, Site Reliability [T500-20517]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
Engineer, Site Reliability [T500-20515]

Engineer, Site Reliability [T500-20515]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
AWS Site Reliability Engineer

AWS Site Reliability Engineer

HTC Global ServicesHyderabad, Telangana, India
Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 2 days ago
  • Promoted
Principal Engineer, Site Reliability - Accounting Technology [T500-20232]

Principal Engineer, Site Reliability - Accounting Technology [T500-20232]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 20 days ago
  • Promoted
Engineer, Site Reliability [T500-20521]

Engineer, Site Reliability [T500-20521]

ANSRHyderabad, Telangana, India
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 13 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

GSPANN Technologies, IncHyderabad, Telangana, India
GSPANN is a global IT services and consultancy provider headquartered in Milpitas, California (U.With five global delivery centers across the globe, GSPANN provides digital solutions that support t...Show moreLast updated: 30+ days ago