Talent.com
Site Reliability Engineer
Site Reliability EngineerCitNOW Group • Hyderabad, Telangana, India
No longer accepting applications
Site Reliability Engineer

Site Reliability Engineer

CitNOW Group • Hyderabad, Telangana, India
9 days ago
Job description

About us

Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships.

CitNOW Group was formed in 2021 to unite a portfolio of 12 global software companies leveraging innovation to aid retailers and manufacturers in delivering an outstanding customer experience. We have over 300 employees worldwide who all contribute to our vision to provide market-leading automotive solutions to drive efficiencies, seamlessly transforming every customer moment.

The CitNOW Group is no ordinary technology company, we live a series of One Team values and this guiding principle forms the foundation of CitNOW Group’s award winning, collaborative and inclusive culture. Recognised recently within the Top 25 Best Mid Sized Companies to work for within the UK, we pride ourselves on being a great place to work.

About the role

We are looking for a proactive and experienced Site Reliability Engineer (SRE) to join our Engineering team remotely in India. The ideal candidate will have deep expertise in cloud operations, automation, monitoring, and reliability engineering, with hands-on experience managing a wide range of SaaS and infrastructure tools. The role focuses on ensuring system uptime, performance, and scalability across our global platform.

Key responsibilities :

Reliability & Infrastructure Management

Design, implement, and manage scalable cloud infrastructure on Google Cloud (GCP) and AWS

Manage integrations and operations across third-party platforms including Mongo Atlas, Cloudflare, Stripe, Cledara, Datadog, Atlassian Status page, Semaphore, Postmark, SendGrid, Lokalise, Zendesk (Smooch & Smooch EU), Twilio, Mailgun, Facebook, Google Workspace, Asana, GitHub, Ngrok, npm, Readme, Loom, Deepgram, and OpenAI

Implement Infrastructure as Code (IaC) using tools like Terraform or Ansible to automate provisioning and scaling

Ensure systems adhere to security, compliance, and reliability best practices

Monitoring, Alerting & Incident Management

Build and maintain observability solutions using Datadog, GCP Logging, and related tools for monitoring system health, latency, and performance

Define and manage SLOs, SLIs, and SLAs to measure and maintain reliability

Implement proactive alerting, diagnostics, and runbooks for efficient incident response

Participate in on-call rotations and lead root cause analyses (RCA) for post-incident reviews

Automation & CI / CD

Design and optimize CI / CD pipelines using Semaphore CI / CD, GitHub Actions, or similar tools

Develop automation scripts and utilities in Python, Bash, or equivalent scripting languages to streamline operations and reduce manual interventions

Integrate and automate workflows between systems such as Asana, Github, and Google Workspace for operational efficiency

Security & Governance

Manage identity and access controls across cloud services and third-party SaaS platforms

Implement best practices for secrets management, data protection, and compliance with privacy standards

Collaboration & Continuous Improvement

Partner closely with developers to design resilient, high-performing services.

Promote an SRE culture focused on continuous learning, blameless postmortems, and process improvement.

Maintain up-to-date operational documentation, playbooks, and architectural diagrams.

We are looking for :

Bachelor’s degree in computer science, Engineering, or related field

4+ years of experience in Site Reliability Engineering, DevOps, or Cloud Operations

Strong experience with Google Cloud Platform (GCP), Amazone Web Services (AWS) and Mongo Atlas

Proven ability to manage and integrate multiple SaaS and developer tools (Datadog, Cloudflare, Atlassian Status page, Semaphore, SendGrid, etc.)

Hands-on experience with CI / CD pipelines, Terraform, GitHub Actions, and containerized environments (Docker, GCP Cloud Run, or Kubernetes)

Expertise in monitoring, incident response, and system optimization

Excellent troubleshooting, documentation, and communication skills

Strong collaboration mindset aligned with cross-functional development and operations teams

In addition to a competitive salary, our benefits package is second to none. Employee wellbeing is at the heart of our people strategy, with a number of innovative wellness initiatives such as flexi-time, where employees can vary their start and finish times within our core business hours and / or extend their lunch break by up to 2 hours per day. Employees also benefit from an additional two half days paid leave per year to focus on their personal wellbeing.

We recognise the development of our people is vital to the ongoing success of the business and proudly promote a culture of continuous learning and improvement, along with opportunities to develop and progress a successful career with us.

The CitNOW Group is an equal opportunities employer that celebrates diversity across our international teams. We are passionate about creating an inclusive workplace where everyone’s individuality is valued.

View our candidate privacy policy here - CitNOW-Group-Candidate-Privacy-Policy.pdf (citnowgroup.com)

Create a job alert for this search

Site Reliability Engineer • Hyderabad, Telangana, India

Related jobs
Sr Engineer, Site Reliability Engineer

Sr Engineer, Site Reliability Engineer

TMUS Global Solutions • Hyderabad, India
The Senior Systems Reliability Engineer (SRE) ensures the stability, performance, and reliability of IT services and infrastructure. This role combines software engineering and operations expertise ...Show more
Last updated: 30+ days ago • Promoted
Engineer, Site Reliability [T500-20517]

Engineer, Site Reliability [T500-20517]

TMUS Global Solutions • Hyderabad, Telangana, India
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
Last updated: 30+ days ago • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

IntraEdge • Hyderabad, IN
Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show more
Last updated: 23 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Tata Consultancy Services • Hyderabad, Telangana, India
GKE(Preferable); Kubernetes (Any cloud) + PostgresSQL, SQL(Must).Linux (Optional), Java (Optional) , Kubernetes (CLI), Prior Production support experience, Release Management, Prior Deployment expe...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Inspire Brands Hyderabad Support Center • Hyderabad, India
Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The companys technology hub, Inspire Brands Hyderabad Support Center, India, will le...Show more
Last updated: 19 days ago • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

AutoRABIT • Hyderabad, Telangana, India
AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show more
Last updated: 30+ days ago • Promoted
Sr Engineer, Site Reliability

Sr Engineer, Site Reliability

TMUS Global Solutions • Hyderabad, India
The Senior Systems Reliability Engineer (SRE) ensures the stability, performance, and reliability of IT services and infrastructure. This role combines software engineering and operations expertise ...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Synamedia • secunderabad, telangana, in
At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show more
Last updated: 4 days ago • Promoted
Site Reliability Engineer [T500-21132]

Site Reliability Engineer [T500-21132]

Inspire • Hyderabad, Telangana, India
Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will l...Show more
Last updated: 10 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Capgemini • Hyderabad, IN
Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
Last updated: 20 days ago • Promoted
Engineer - Site Relibility - FPT

Engineer - Site Relibility - FPT

Talent500 INC • Hyderabad, India
Engineer - Site Reliability - FPT.As a Site Reliability Engineer, youll play a crucial role in keeping our digital backbone running seamlessly for millions of customers. Your mission : reduce inciden...Show more
Last updated: 30+ days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

HRhelpdesk • hyderabad, telangana, in
Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.As a Site Reliability Engineer (SRE), you will be responsible for building and maintainin...Show more
Last updated: 19 hours ago • Promoted • New!
Senior Site Reliability Engineer

Senior Site Reliability Engineer

o9 Solutions, Inc. • secunderabad, telangana, in
Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
Last updated: 1 day ago • Promoted
Senior Site Reliability Engineer

Senior Site Reliability Engineer

Nebula Tech Solutions • hyderabad, telangana, in
SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show more
Last updated: 11 days ago • Promoted
Engineer, Site Reliability [T500-20515]

Engineer, Site Reliability [T500-20515]

TMUS Global Solutions • Hyderabad, Telangana, India
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
Last updated: 30+ days ago • Promoted
Engineer, Site Reliability [T500-20266]

Engineer, Site Reliability [T500-20266]

TMUS Global Solutions • Hyderabad, Telangana, India
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
Last updated: 30+ days ago • Promoted
Engineer, Site Reliability

Engineer, Site Reliability

TMUS Global Solutions • Hyderabad, India
Engineer reliability : Identify potential system issues early, implement preventive measures, and boost system resilience. Automate for speed : Build tools, pipelines, and scripts that eliminate manua...Show more
Last updated: 30+ days ago • Promoted
Engineer, Site Reliability [T500-20519]

Engineer, Site Reliability [T500-20519]

TMUS Global Solutions • Hyderabad, Telangana, India
NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show more
Last updated: 30+ days ago • Promoted