Site Reliability Engineering Manager

People Hire ConsultingKollam, Republic Of India, IN

2 hours ago

Job description

Looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure

stability, reliability and performance and rapid deployments of our platform. We build teams that

are inclusive, collaborative, and have a strong sense of ownership for the things they build. If you

have a passion and track record for solving problems;

moreover, have strong leadership skills, this is a great fit for you.

As Manager, SRE you will demonstrate both emerging and current technologies, methods, and

processes contributing to the evolution of software deployment processes, enhancing security,

reducing risk, and improving the overall end-user experience. As part of the Technology R&D Team, you will play an integral part in advancing DevOps maturity and be a part of a new culture of quality and site reliability. You will continually improve our CI / CD tools, processes, and procedures. You will also be responsible for regular reporting to Senior Technology Leaders and providing updates on organizational risk exposure and risk related issues.

What You Will Be Doing :

Set the direction and strategy for your team, and help shape the overall SRE program for the

company

Support the growth by ensuring a robust, scalable, cloud-first infrastructure

Own site stability, performance and capacity planning

Participate early in the SDLC to ensure reliability is built in from the beginning, and creating

plans for successful implementations / launches

Foster a learning and ownership culture within the team and the larger organization

Ensure best engineering practices through automation, infrastructure as code, robust system

monitoring, alerting, auto scaling, self-healing, etc...

Manage complex technical projects and a team of SREs

Recruit and develop staff;

build a culture of excellence in site reliability and automation

Lead by example – roll up your sleeves by debugging and coding;

participate in on-call rotation

& occasional travel

Represent the technology perspective and priorities to leadership and other stakeholders by

continuously communicating timeline, scope, risks, and technical road map

What You Will Need for this Position :

10+ years of hands-on technical leadership and people management experience

3+ years of demonstrable experience leading site reliability and performance in large-scale,

high-traffic environments

Strong leadership, communication and interpersonal skills geared to getting things done

Developing themselves and the talent within their charge – fostering and creating

opportunity for the team

Architect-level understanding of one or more of the major public cloud services (AWS, GCP or

Azure), using them to effectively design secure and scalable services

Strong understanding of SRE concepts and the DevOps culture, with a focus on leveraging

software engineering tools, methodologies and concepts

In-depth understanding of automation and CI / CD processes to go along with excellent

reasoning and problem-solving skills

Experience with Unix / Linux environments with a deep grasp on system internals

Worked on large-scale distributed systems including multi-tiered architecture

Strong knowledge of modern platforms like Fargate, Docker, Kubernetes etc.

Experience working with monitoring tools (Datadog, NewRelic, ELK stack, etc) and Database

technologies (SQL Server, Postgres and Couchbase preferred)

Validated breadth of understanding and development of solutions based on multiple

technologies, including networking, cloud, database, and scripting languages.

Experience in prompt engineering, building AI Agents, or MCP is a plus.

Create a job alert for this search

Engineering Manager • Kollam, Republic Of India, IN

Related jobs

Promoted

Equifax - Senior Site Reliability Engineer - IAC Terraform

EquifaxTrivandrum

About the job Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distr...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer

CapgeminiKollam, IN

Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 16 days ago

Promoted
New!

Site Reliability Engineering Manager

People Hire ConsultingAlleppey, Republic Of India, IN

Looking for a Manager, Site Reliability Engineering to help us scale our systems and ensure.As Manager, SRE you will demonstrate both emerging and current technologies, methods, and.As part of the ...Show moreLast updated: 2 hours ago

Promoted

Equifax - Site Reliability Engineer

EquifaxThiruvananthapuram

Site Reliability Engineering (SRE) at Equifax SRE is a discipline that combines software and systems engineering for building and running large-scale, distrib...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer II

ConfidentialThiruvananthapuram, Thiruvananthapuram / Trivandrum, India

The world's top banks use Zafin's integrated platform to drive transformative customer value.Powered by an innovative AI-powered architecture, Zafin's platform seamlessly unifies data from across t...Show moreLast updated: 10 days ago

Promoted

Engineering Manager

TamaraThiruvananthapuram, IN

Tamara is the leading fintech platform in Saudi Arabia and the wider GCC region with a mission to help people make their dreams come true by building the most customer-centric financial super-app o...Show moreLast updated: 30+ days ago

Promoted

Senior Site Reliability Engineer

IntraEdgeThiruvananthapuram, IN

Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 19 days ago

Promoted

Senior Site Reliability Engineer

ConfidentialThiruvananthapuram / Trivandrum, India

Site Reliability Engineering (SRE).Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems.SRE ensures that ...Show moreLast updated: 10 days ago

Promoted

Site Reliability Engineer (SRE)

ConfidentialThiruvananthapuram / Trivandrum

As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability.Should be able to gathe...Show moreLast updated: 30+ days ago

Promoted

Senior Site Reliability Engineer- ELK Expert

iVedha Inc.Alappuzha, IN

Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago

Promoted

Site Reliability Engineer (SRE) – Infrastructure & Automation

InstaServiceThiruvananthapuram, IN

InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 4 days ago

Promoted

Site Reliability Engineer

CitNOW GroupAlappuzha, IN

Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-based plat...Show moreLast updated: 5 days ago

Promoted
New!

Site Reliability Engineer

Synamediakollam, kerala, in

At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 13 hours ago

Promoted

Sr Engineer, Site Reliability [T500-21295]

TMUS Global Solutionsthiruvananthapuram, kerala, in

NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 5 days ago

Promoted

Sr Engineer, Site Reliability T500-21295

TMUS Global SolutionsAlleppey, Republic Of India, IN

Promoted

Site Reliability Engineer

CodeKarmathiruvananthapuram, kerala, in

Site Reliability Engineer (Multi-Cloud Deployments).CodeKarma is redefining how engineering teams understand and evolve complex systems — bringing production context directly into the developer’s w...Show moreLast updated: 27 days ago

Promoted

Senior Site Reliability Engineer

Nebula Tech Solutionsalappuzha, India

SRE team supporting mission-critical applications for our.We’re now looking for engineers who can go beyond operations — those who can. Enhance application reliability through code.Add or modify cod...Show moreLast updated: 6 days ago

Promoted

Senior Site Reliability Engineer- Elk Expert

iVedha Inc.Thiruvananthapuram, Republic Of India, IN