Talent.com
This job offer is not available in your country.
Senior Site Reliability Engineer - IAC Terraform

Senior Site Reliability Engineer - IAC Terraform

Options Executive Search Private LimitedHyderabad
30+ days ago
Job description

Job Title : SRE Lead Engineer.

Location : Hyderabad, India.

We are seeking a DevOps / SRE Lead Engineer to architect and scale our client's multi-tenant SaaS platform with AI / ML at the core.

Our client, a fast-growing AI-powered SaaS company in the FinTech space, is looking for a Site Reliability Engineering (SRE) Lead Engineer to join their dynamic team.

This is an opportunity to design and operate large-scale SaaS systems that integrate cutting-edge AI / ML capabilities.

About the Role :

As the SRE Lead Engineer, you will be responsible for architecting, building, and maintaining infrastructure that powers a multi-tenant SaaS platform.

Youll drive reliability, scalability, and security, while supporting AI / ML pipelines in production.

This is a hands-on role with significant ownership, requiring both technical depth and leadership in site reliability practices.

Key Responsibilities :

  • Architect, design, and deploy end-to-end infrastructure for large-scale, microservices-based SaaS platforms.
  • Ensure system reliability, scalability, and security for AI / ML model integrations and data pipelines.
  • Automate environment provisioning and management using Terraform in AWS (EKS-focused).
  • Implement full-stack observability across applications, networks, and operating systems.
  • Lead incident management and participate in 24 / 7 on-call rotation.
  • Optimize SaaS reliability while enabling REST APIs, SSO integrations (Okta / Auth0), and cloud data services (RDS / MySQL, Elasticsearch).
  • Define and maintain backup and disaster recovery for critical workloads.

Required Skills & Experience :

  • 8+ years in SRE / DevOps roles, managing enterprise SaaS applications in production.
  • Minimum 1 year experience with AI / ML infrastructure or model-serving environments.
  • Strong expertise in AWS cloud, particularly EKS, container orchestration, and Kubernetes.
  • Hands-on experience with Infrastructure as Code (Terraform), Docker, and scripting (Python, Bash).
  • Solid Linux OS and networking fundamentals.
  • Experience in monitoring and observability with ELK, CloudWatch, or similar tools.
  • Strong track record with microservices, REST APIs, SSO, and cloud databases.
  • Nice-to-Have Skills :

  • Experience with MLOps and AI / ML pipeline observability.
  • Cost optimization and security hardening in multi-tenant SaaS.
  • Prior exposure to FinTech or enterprise finance solutions.
  • Qualifications :

  • Bachelors degree in Computer Science, Engineering, or related discipline.
  • AWS Certified Solutions Architect (strongly preferred).
  • Experience in early-stage or high-growth startups is an advantage.
  • Why Join?

  • Be at the forefront of AI / ML-powered SaaS innovation in FinTech.
  • Work with a high-energy, entrepreneurial team building next-gen infrastructure.
  • Take ownership of mission-critical reliability challenges.
  • Grow your career in an environment that values impact, adaptability, and innovation.
  • (ref : hirist.tech)

    Create a job alert for this search

    Senior Site Reliability Engineer • Hyderabad

    Related jobs
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABIThyderabad, telangana, in
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GSPANN Technologies, Inchyderabad, telangana, in
    GSPANN is a global IT services and consultancy provider headquartered in Milpitas, California (U.With five global delivery centers across the globe, GSPANN provides digital solutions that support t...Show moreLast updated: 6 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20279]

    Sr Engineer, Site Reliability [T500-20279]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago
    • Promoted
    Engineer, Site Reliability [T500-20518]

    Engineer, Site Reliability [T500-20518]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Xebiasecunderabad, telangana, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 25 days ago
    • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 14 days ago
    • Promoted
    Site Reliability Engineer - AIOps / Observability Services

    Site Reliability Engineer - AIOps / Observability Services

    Intraedge Technologies Ltd.Hyderabad
    L2Observability / AIOps : Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, m...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago
    • Promoted
    Engineer, Site Reliability [T500-20504]

    Engineer, Site Reliability [T500-20504]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 7 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20439]

    Sr Engineer, Site Reliability [T500-20439]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20425]

    Sr Engineer, Site Reliability [T500-20425]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2hyderabad, telangana, in
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 7 days ago
    • Promoted
    Engineer, Site Reliability [T500-20519]

    Engineer, Site Reliability [T500-20519]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20463]

    Sr Engineer, Site Reliability [T500-20463]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20437]

    Sr Engineer, Site Reliability [T500-20437]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20444]

    Sr Engineer, Site Reliability [T500-20444]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20446]

    Sr Engineer, Site Reliability [T500-20446]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago