Talent.com
Senior Site Reliability Engineer (Middleware)
Senior Site Reliability Engineer (Middleware)Nextiva • Chennai, Tamil Nadu, India
Senior Site Reliability Engineer (Middleware)

Senior Site Reliability Engineer (Middleware)

Nextiva • Chennai, Tamil Nadu, India
30+ days ago
Job description

Redefine the future of customer experiences. One conversation at a time.

At Nextiva were reimagining how businesses connect bringing together customer experience and team collaboration on a single conversation centric platform. Powered by AI driven by human innovation.

Our culture is forward thinking customer obsessed and built on the belief that meaningful connections drive better business outcomes. Whether its through our signature Amazing Service the technology we create or the experiences we cultivate connection is at the core of who we are.

If youre ready to collaborate with incredible people make an impact and help businesses everywhere deliver truly amazing experiences this is where you belong.

Build Amazing. Deliver Amazing. Live Amazing. Be Amazing.

We are looking for a Senior Site Reliability Engineer (SRE) to join our Middleware Engineering this highly dynamic environment youll be responsible for supporting and scaling our Kafka and Elasticsearch infrastructure - core systems that power our SaaS platform.

Were looking for someone who thrives on automation embraces AI-driven observability and is eager to learn and adopt new technologies quickly. Youll not only respond to production issues but proactively build intelligent resilient systems to prevent them.

If you enjoy owning systems end to end writing clean automation and working in a fast-moving team that values innovation this role is for you.

Key Responsibilities

  • Triage troubleshoot and resolve complex production issues involving Kafka and Elasticsearch
  • Design and build automated monitoring alerting and logging systems - leveraging AI / ML techniques where possible
  • Write tools and infrastructure software to support self-healing auto-scaling and incident prevention
  • Automate system administration tasks - from patching and upgrades to config and deployment workflows
  • Use and manage GitHub extensively for infrastructure-as-code release management and collaboration
  • Partner with development QA and performance teams to ensure middleware systems are production-ready
  • Participate in the on-call rotation and continuously improve incident response and resolution playbooks
  • Mentor junior engineers and contribute to a culture of automation learning and accountability
  • Lead large-scale reliability and observability projects in collaboration with global teams

Qualifications

  • Bachelors degree in Computer Science Engineering or equivalent practical experience
  • Fluent English communication skills (spoken and written)
  • Core Competencies

  • 6 years of experience in software development automation or infrastructure engineering
  • Deep experience with MongoDB Kafka and / or Elasticsearch in production environments
  • Strong Linux systems expertise and 6 years managing Linux-based environments
  • Hands-on experience with cloud platforms - GCP and / or AWS required
  • Proficient in scripting languages like Python Bash etc
  • Automation-first mindset - deep experience with Ansible Terraform Jenkins
  • Expert-level understanding of Git and GitHub workflows for CI / CD and infrastructure-as-code
  • Proficient with container tools (Docker) and orchestrators (Kubernetes)
  • Strong understanding of SRE principles - SLAs / SLOs alerting observability and incident management
  • Experience with SQL caching systems (e.g. Redis) and troubleshooting distributed systems
  • Quick learner with a strong curiosity for new tools frameworks and AI / ML use cases in operations
  • Nice to Have

  • Observability Tools : Datadog Splunk Kibana Opsgenie
  • Programming : Java / Spring JavaScript / React
  • Middleware : RabbitMQ Tomcat
  • Experience with AI / ML-based anomaly detection AIOps platforms and LLM integrations for infrastructure
  • Azure cloud experience (nice to have)
  • Why Join Us Why Join Us

  • Shape the future of middleware reliability using AI and intelligent automation
  • Work with a global team that values initiative innovation and ownership
  • Grow in a fast-paced environment where learning and experimentation are part of the culture
  • Drive technical leadership mentor others and make a meaningful platform-wide impact
  • How to Apply

    If youre passionate about automation AIOps MLOps and scalable middleware infrastructure and youre ready to move fast learn constantly and own critical systems - wed love to connect with you.

    Nextiva DNA (Core Competencies)

    Nextivas most successful team members share common traits and behaviors :

  • Drives Results : Action-oriented with a passion for solving problems. They bring clarity and simplicity to ambiguous situations challenge the status quo and ask what can be done differently. They lead and drive change celebrating success to build more success.
  • Critical Thinker : Understands the why and identifies key drivers learning from the past. They are fact-based and data-driven forward-thinking and see problems a few steps ahead. They provide options recommendations and actions understanding risks and dependencies.
  • Right Attitude : They are team-oriented collaborative competitive and hate losing. They are resilient able to bounce back from setbacks zoom in and out and get in the trenches to help solve important problems. They cultivate a culture of service learning support and respect caring for customers and teams.
  • Total Rewards

    Our Total Rewards offerings are designed to allow our employees to take care of themselves and their families so they can be their best in and out of the office.

    Our compensation packages are tailored to each role and candidates qualifications. We consider a wide range of factors including skills experience training and certifications when determining compensation. We aim to offer competitive salaries or wages that reflect the value you bring to our team. Depending on the position compensation may include base salary and / or hourly wages incentives or bonuses.

  • Medical - Medical insurance coverage is available for employees their spouse and up to two dependent children with a limit of 500000 INR as well as their parents or in-laws for up to 300000 INR. This comprehensive coverage ensures that essential healthcare needs are met for the entire family unit providing peace of mind and security in times of medical necessity.
  • Group Term & Group Personal Accident Insurance - Provides insurance coverage against the risk of death / injury during the policy period sustained due to an accident caused by violent visible & external means.
  • Coverage Type - Employee Only

  • Sum Insured - 3 times of annual CTC with minimum cap of INR
  • Free Cover Limit - 1.5 Crore
  • Work-Life Balance - 15 days of Privilege leaves per calendar year 6 days of Paid Sick leave per calendar year 6 days of Casual leave per calendar year. Paid 26 weeks of Maternity leaves 1 week of Paternity leave a day off on your Birthday and paid holidays
  • Financial Security - Provident Fund & Gratuity
  • Wellness - Employee Assistance Program and comprehensive wellness initiatives
  • Growth - Access to ongoing learning and development opportunities and career advancement
  • At Nextiva were committed to supporting our employees health well-being and professional growth. Join us and build a rewarding career!

    #LI-MK1 #LI-Hybrid

    Founded in 2008 Nextiva has grown into a global leader trusted by over 100000 businesses and 1M users worldwide. Headquartered in Scottsdale Arizona and with teams across the globe were the future of customer experience and team collaboration through our AI-powered conversation-centric platform.

    Want to see what life at Nextiva is all about Connect with us on Instagram Instagram MX YouTube LinkedIn and the Nextiva Blog .

    Required Experience :

    Senior IC

    Key Skills

    Kubernetes,FMEA,Continuous Improvement,Elasticsearch,Go,Root cause Analysis,Maximo,CMMS,Maintenance,Mechanical Engineering,Manufacturing,Troubleshooting

    Employment Type : Full Time

    Experience : years

    Vacancy : 1

    Create a job alert for this search

    Senior Site Reliability Engineer • Chennai, Tamil Nadu, India

    Related jobs
    Site Reliability Engineer / Architect - CI / CD Pipeline

    Site Reliability Engineer / Architect - CI / CD Pipeline

    Cling Multi Solutions • Chennai
    Job Description : Role : Site Reliability Engineer (SRE) Location : Bangalore / Chennai / Pune (Hybrid) Experience : 5+ y...Show more
    Last updated: 30+ days ago • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global Services • Chennai, Tamil Nadu, India
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show more
    Last updated: 21 days ago • Promoted
    Sr. DevOps Engineer

    Sr. DevOps Engineer

    Olive Trees Consulting • chennai, tamil nadu, in
    Our client is a manufacturing company, headquartered in UK with their India office in Bangalore.This role is for a Senior Developer with significant DevOps experience. Design, implement, and maintai...Show more
    Last updated: 9 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Intellistaff Services Pvt. Ltd • Chennai, Tamil Nadu, India
    SRE Public Cloud & Cloud Engineering.Docker / Kubernetes, Terraform (incl.DevOps & CI / CD (GitHub, Cloud Build).Scripting : Python, Go, PowerShell, Java, JS / Node. Messaging : Kafka, RabbitMQ, ActiveMQ.Mo...Show more
    Last updated: 8 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies Group • Chennai, Tamil Nadu, India
    Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show more
    Last updated: 12 days ago • Promoted
    Freelance Site Reliability Engineer (SRE) / DevOps Engineer

    Freelance Site Reliability Engineer (SRE) / DevOps Engineer

    ThreatXIntel • mount, India
    ThreatXIntel is a startup cyber security company focused on delivering customized, affordable solutions to protect businesses and organizations from cyber threats. Our experienced team specializes i...Show more
    Last updated: 14 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Grootan Technologies • Chennai, Tamil Nadu, India
    Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show more
    Last updated: 11 days ago • Promoted
    Site Reliability Engineer (SRE) / DevOps Engineer

    Site Reliability Engineer (SRE) / DevOps Engineer

    Stoopa AI • Chennai, Tamil Nadu, India
    AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring ...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Aim Plus Staffing Solutions • Chennai
    Mandatory skills : We are seeking a highly skilled Site Reliability Engineer (SRE) with strong expertise in Google Cloud Platform (GCP) and CI / CD automation to lead cloud infra...Show more
    Last updated: 19 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Tata Consultancy Services • Chennai, Tamil Nadu, India
    GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more
    Last updated: 30+ days ago • Promoted
    Senior Site Reliability Engineer (C# / Python)

    Senior Site Reliability Engineer (C# / Python)

    Entech • Chennai, IN
    Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show more
    Last updated: 6 days ago • Promoted
    Senior DevOps & Database Reliability Engineer – 100% Remote

    Senior DevOps & Database Reliability Engineer – 100% Remote

    Hyly.AI • Chennai, IN
    Remote
    AI, we’re building the first AI + Data Fabric for the multifamily industry, transforming how clients manage, secure, and scale their marketing and operational data. As the industry moves toward a co...Show more
    Last updated: 13 days ago • Promoted
    TCS Walkin Drive For Site Reliability Engineering (SRE)

    TCS Walkin Drive For Site Reliability Engineering (SRE)

    Tata Consultancy Services • Chennai, Tamil Nadu, India
    Site Reliability Engineering (SRE)Ops.TCS has been a great pioneer in feeding the fire of young Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us ...Show more
    Last updated: 6 days ago • Promoted
    Athenahealth - Senior Site Reliability Engineer - On-Premises Infrastructure

    Athenahealth - Senior Site Reliability Engineer - On-Premises Infrastructure

    athenaHealth Technology Private Limited. • Chennai
    Description : Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for al...Show more
    Last updated: 30+ days ago • Promoted
    Keuro Life - Senior Site Reliability Engineer - DevOps

    Keuro Life - Senior Site Reliability Engineer - DevOps

    Keuro Life • Chennai
    Site Reliability Engineer / DevOps We are seeking an experienced Site Reliability Engineer / DevOps professional with a minimum of 6 years in the industry.The ideal c...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Chennai, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more
    Last updated: 19 days ago • Promoted
    Miratech - Senior Site Reliability Engineer

    Miratech - Senior Site Reliability Engineer

    Miratech • Chennai
    Description : About Miratech : Miratech helps visionaries change the world.We are a global IT services and consulting company tha...Show more
    Last updated: 15 days ago • Promoted
    Site Reliability Engineer - Elastic Kubernetes Service

    Site Reliability Engineer - Elastic Kubernetes Service

    MNR Solutions • Chennai
    Description : Site Reliability Engineer (SRE) Kubernetes & Cloud Position Summary : We are seeking a...Show more
    Last updated: 30+ days ago • Promoted