Talent.com
This job offer is not available in your country.
Site Reliability Engineering Manager

Site Reliability Engineering Manager

EpsilonIndia
10 days ago
Job description

About Business Unit :

SaaSOps leads post-production support and the overall experience of Epsilon PeopleCloud products for our global clients. This function is responsible for product support, incident management, managed operations and the automation of processes. The team has successfully incubated and mainstreamed Site Reliability Engineering (SRE) as a practice, to ensure reliable product operations on a global scale. Plus, the team is actively leading the adoption of AI in operations (AIOps) and recently launched AI-driven self-service capabilities to enhance operational efficiency and improve client experiences.

Click here to view how Epsilon transforms marketing with 1 View, 1 Vision and 1 Voice.

Responsibilities

Will be a senior IC role responsible for driving strong operations engineering practices in SaaS product operations.

Role will be working closely with engineering, delivery and operations team to ensure streamlined release and change management processes

Role will be closely working with product operations team to deep dive and identify root cause of production issues and work with concerned teams to come up with a permanent fix to recurring issues

Role will identify automation opportunities to streamline repeat tasks.

Will contribute to evolution of AIOps strategy - identify use cases and come up with AI / Agentic autonomous solutions

Qualifications

15+ Years of candidates in SRE

The candidate will be hands-on technology leader with a proven experience working as a SRE leader in a product set up.

The ideal candidate should have a strong full stack engineering background with Cloud & AI / Gen AI experience

Must have strong development skills - at least two of Python, Java, C#; strong DB skills (RDBMS, NoSql, Cloud DBs), Container / orchestration, Cloud Infrastructure

Super proficient in atleast one hyperscaler cloud (AWS, GCP, Azure)

Demonstrated real world experience in traditional ML & Gen AI use case deployments in production

Candidate should have had experience in working closely with Engineering & Operations team - must have a strong DevOps, Release management, change management experience

Experience in AIOps will be an added advantage.

Must have proven skills in collaboration and getting things done

Epsilon is a global data, technology and services company that powers the marketing and advertising ecosystem. For decades, we’ve provided marketers from the world’s leading brands the data, technology and services they need to engage consumers with 1 View, 1 Vision and 1 Voice. 1 View of their universe of potential buyers. 1 Vision for engaging each individual. And 1 Voice to harmonize engagement across paid, owned and earned channels.

Epsilon’s comprehensive portfolio of capabilities across our suite of digital media, messaging and loyalty solutions bridge the divide between marketing and advertising technology. We process 400+ billion consumer actions every single day using advanced AI and hold many patents of proprietary technology, including real-time modeling languages and consumer privacy advancements. Thanks to the work of every employee, Epsilon has been consistently recognized as industry-leading by Forrester, Adweek and the MRC. Epsilon is a global company with more than 9,000 employees around the world.

Epsilon has a core set of 5 values that define our culture and guide us to bring value for our clients, our people and consumers. We are seeking candidates that align with our values, demonstrate them and make them meaningful in their day-to-day work :

Additional Information

Act with integrity . We are transparent and have the courage to do the right thing.

Work together to win together . We believe collaboration is the catalyst that unlocks our full potential.

Innovate with purpose . We shape the market with big ideas that drive big outcomes.

Respect all voices . We embrace differences and foster a culture of connection and belonging.

Empower with accountability . We trust each other to own and deliver on common goals.

Because You Matter

YOUniverse. A work-world with you at the heart of it!

At Epsilon, we believe people make the place. And everything we do is designed with you in mind. That’s why our work-world, aptly named ‘YOUniverse’ is passionate about crafting a nurturing environment that elevates your growth, wellbeing and work-life harmony. So, come be part of a people-centric workspace where care for you is at the core of all we do.

Take a trip to YOUniverse and explore our outstanding benefits, here

Epsilon is an Equal Opportunity Employer.

Epsilon is committed to promoting diversity, inclusion, and equal employment opportunities by using reasonable efforts to attract, recruit, engage and retain qualified individuals of all ethnicities and backgrounds, including, but not limited to, women, people of color, LGBTQ individuals, people with disabilities and any other underrepresented groups, traits or characteristics.

Create a job alert for this search

Engineering Manager • India

Related jobs
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ExasoftIndia, India
Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 2 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

BirlasoftIndia
Be primarily responsible for providing production, operations support and application administration to business and web applications, 3rd party applications and related ecosystems.The application ...Show moreLast updated: 27 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Insight GlobalIndia
USD Must be able to join within 30 days or less! Job Description : An employer is looking for an SRE to join their enterprise level SRE team. They are building a specialized team of Senior Site Relia...Show moreLast updated: 30+ days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

UplersNagpur, IN
Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 27 days ago
  • Promoted
Manager, Site Reliability Engineering (Cortex XDR XSIAM)

Manager, Site Reliability Engineering (Cortex XDR XSIAM)

Palo Alto NetworksIndia
At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show moreLast updated: 7 days ago
  • Promoted
Engineer, Site Reliability [T500-20515]

Engineer, Site Reliability [T500-20515]

ANSRIndia
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 9 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ViewSonicIndia
Job Requirements : Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding o...Show moreLast updated: 20 days ago
  • Promoted
Principal Site Reliability Engineer

Principal Site Reliability Engineer

Rakuten IndiaIndia
Responsibilities : Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, a...Show moreLast updated: 20 days ago
  • Promoted
Site Reliability Engineering Manager

Site Reliability Engineering Manager

TechBlocksIndia
Job Title : Site Reliability Engineering (SRE) Manager.Work Model - 3 Days from office (Hybrid).The SRE Manager at TechBlocks India will lead the reliability engineering function, ensuring infrastru...Show moreLast updated: 23 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

XebiaNagpur, IN
AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 29 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

Luxoft IndiaIndia
We are looking for an experienced technical developer to work for one of our client from the banking industry.Project goal is to maintain and develop solutions. Design, develop, and improve the digi...Show moreLast updated: 20 days ago
  • Promoted
Engineer, Site Reliability [T500-20518]

Engineer, Site Reliability [T500-20518]

ANSRIndia
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 10 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

TavantIndia
With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 28 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

BayOne Solutionsnagpur, maharashtra, in
Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 2 days ago
  • Promoted
Senior Site Reliability Engineer- ELK Expert

Senior Site Reliability Engineer- ELK Expert

iVedha Inc.nagpur, maharashtra, in
Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
  • Promoted
Engineer, Site Reliability [T500-20504]

Engineer, Site Reliability [T500-20504]

ANSRIndia
ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 10 days ago
  • Promoted
Engineer, Site Reliability [T500-20266]

Engineer, Site Reliability [T500-20266]

ANSRIndia
ANSR is hiring for one of its clients.About T-Mobile : T-Mobile US, Inc.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its st...Show moreLast updated: 19 days ago
  • Promoted
Site Reliability Engineer

Site Reliability Engineer

ConcordNagpur, IN
Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 20 days ago