Staff Engineer- SRE

ConfidentialIndia

5 days ago

Job description

Urgent Hiring!!

Location : Remote

Role : Staff Engineer- SRE

Experience : 10+

The Site Reliability Engineering (SRE) team is responsible for the reliability, scalability,

stability and performance of systems and services.

They work with cross-functional teams to design, build and maintain systems and they

troubleshoot issues when they arise. They bridge the gap between development and

operations teams.

They work closely with business teams to define Service Level Objectives

(SLO) and agreements (SLA) of critical systems. They also monitor and maintain the

uptime of these systems in-line with the defined SLO's and SLA's.

They deploy and manage monitoring tools to gain insights on system health and

performance.

They analyze performance, identify bottlenecks and implement solutions to

improve a system's scalability and latency durations.

They develop scripts, implement tools and automation frameworks to reduce the manual

intervention efforts of deployment, monitoring and scaling.

They work with development teams for design and development of observability

practices like logging, metrics, tracing, etc. They aim to diagnose and troubleshoot issues

proactively.

They create actionable alerts on monitoring systems to ensure rapid response for

potential production incidents.

They forecast resource needs and provision adequately for current and future demand.

They design and execute 'chaos experiments' to test system's failure resiliency.

They own, define and implement the Disaster Recovery (DR) processes for systems.

They also conduct planned and unplanned mock DR drills to test for response

preparedness during production incidents.

They ensure that security best practices are followed and implemented during design

and operations of systems.

They also own and maintain documentation of processes, playbooks, and systems.

They publish KPI reports and other system health updates on a regular basis to the

business.

Requirements

Must-have - Bachelor's degree, preferably in CS or a related field, or equivalent

Experience

Must-have - 12+ years of overall IT experience

Must-have - 7+ year of proven work experience as a Senior Site Reliability Engineer or a

similar position.

Must-have - 5+ years of AWS Cloud experience with AWS Certified DevOps Engineer or

SysOps or Security etc.

Must-have - AWS experience - 3+ years' experience with using a broadrange of AWS

technologies (e.g. EC2, RDS, ELB, S3, VPC, CloudWatch & Monitoring Tools) to develop

and maintain an Amazon AWS based cloud solution, with an emphasis on best practice

cloud security.

Must-have - 2+ year of experience in CDN and / or Cache systems like Fastly, Akamai,

CloudFront, etc.

Proven Understanding & strong experience with Cloud deployments ( AWS / Docker /

Kubernetes)

Knowledge on provisioning IAC Tools like Terraform, Chef, Ansible, Shell, groovy,

python, etc.

Experience with monitoring systems such as CloudWatch, NewRelic, Datadog / Splunk,

ELK stack.

Experience managing cloud network resources (AWS Preferred) such as CloudWatch,

VPC, URL proxies, private link, DNS, ACLs, firewalls, and C2S access points.

Platform or Application Engineering and Operational Knowledge in any of the CI / CD

tooling like GitHub Actions, Jenkins, etc.

Experience in other tooling Technologies like JIRA, Bitbucket, Jenkins, Fortify,

SonarQube, Nexus, Nexus IQ

Experience with configuration automation tools like Puppet / Ansible / Chef / Salt

Scripting Skills : Strong scripting (e.g. Bash & Python) and automation skills.

Operating Systems : Windows and Linux system administration.

Problem Solving : Ability to analyze and resolve complex infrastructure resource and

application deployment issues

Strong attention to detail. Excellent verbal and written communication skills. Strong

documentation skills.

Good To Have

Experience with Terraform / Ansible / Chef / Puppet

Experience with GitHub Actions

Experience with CloudFront, Fastly

Oversees team members performing these functions

Anticipates problems and future technical needs and takes necessary steps to address

issues.

Work primarily in server side technologies and comfortable with client side whenever

Required

Enthusiastically follow technology trends, software engineering best practices and

technologies

Perks

Day off on the 3rd Friday of every month (one long weekend each month)

Monthly Wellness Reimbursement Program to promote health well-being

Paid paternity and maternity leaves

Notice Period : Immediate- 30 Days

Email to : [HIDDEN TEXT]

Skills Required

Newrelic, Chef, Fortify, Elk Stack, Bash, Datadog, Jira, Jenkins, Cloudwatch, Docker, Bitbucket, Terraform, Ansible, Sonarqube, Nexus, Splunk, Puppet, Python, Kubernetes, Aws

Create a job alert for this search

Staff Engineer • India

Related jobs

Promoted

Senior Site Reliability Engineer- ELK Expert

iVedha Inc.Nagpur, IN

Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago

Promoted

Staff Software Engineer

First American (India)Nagpur, IN

The Staff Engineer is a senior technical leader responsible for setting engineering direction, delivering resilient platforms, and elevating engineering excellence across squads.You will drive high...Show moreLast updated: 30+ days ago

Promoted

Staff Engineer

Workfabric AIRepublic Of India, IN

We are seeking an experienced Staff Engineer to lead the architecture, design, and large scale deployment of the ContextSensor, a core component of the ContextFabric platform.The ContextSensor powe...Show moreLast updated: 22 days ago

Promoted

Staff Engineer

ConfidentialIndia

ApplyBoard simplifies the study abroad search, application, and acceptance process by connecting international students, recruitment partners, and educational institutions on one intuitive and pers...Show moreLast updated: 30+ days ago

Promoted

Senior Staff Engineer

ChargebeeChennai, Republic Of India, IN

Chargebee is looking for an inspirational Senior Staff Engineer for driving the Next Generation of Subscription to create a revolutionary subscriptions experience for its customers.In this role, yo...Show moreLast updated: 1 day ago

Promoted

Senior Site Reliability Engineer

IntraEdgeNagpur, IN

Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 14 days ago

Promoted

DevSecOps / AppSecOps Staff Engineer

First American (India)Nagpur, IN

Our people-first culture empowers bold thinkers and passionate technologists to solve real-world challenges through scalable architecture and innovative design. If you're driven by impact, thrive in...Show moreLast updated: 30+ days ago

Promoted

Sr Full Stack Engineer

Mitchell Martin Inc.Nagpur, IN

We’re looking for a Senior Full Stack Software Engineer who’s passionate about clean code, scalable architecture, and continuous improvement. You’ll collaborate across teams to design, develop, and ...Show moreLast updated: 2 days ago

Promoted

Staff Engineer

OnArrivalnagpur, India

OnArrival is redefining the travel tech industry by building the world’s most advanced full-stack travel platform.We provide seamless, intelligent travel infrastructure, powering everything from fl...Show moreLast updated: 16 days ago

Promoted

Staff Software Engineer

Andalusia LabsNagpur, IN

At Andalusia Labs, we build foundational economic infrastructure for programmable global markets, connecting capital, computation, and coordination across the internet. Our work sits at the intersec...Show moreLast updated: 1 day ago

Promoted

Sr. Full Stack Engineer

BrightEdgeNagpur, IN

BrightEdge is a global leader in enterprise SEO and content performance solutions, driving AI-powered digital marketing success for the world’s top brands. Our culture is product-first, innovation-d...Show moreLast updated: 30+ days ago

Promoted

Staff Engineer

Talent et au-delaPune, Republic Of India, IN

Staff Engineer (Software Development).Core Technical Product Development Background).Location : Mumbai / Pune / Gurgaon / Noida. As Staff Engineer you will be Leading the application development with...Show moreLast updated: 2 days ago

Promoted

Deployment Engineer

AvocaNagpur, IN

Build, launch & optimize AI agents that power the next generation of home-service customer experiences.Avoca is the all-in-one AI lead-conversion platform. Our technology boosts booking rates, slash...Show moreLast updated: 30+ days ago

Promoted

Staff Machine Learning Engineer

OcrolusNagpur, IN

Come build at the intersection of AI and fintech.At Ocrolus, we’re on a mission to help lenders automate workflows with confidence—streamlining how financial institutions evaluate borrowers and ena...Show moreLast updated: 1 day ago

Promoted

Staff Site Reliability Engineer

PoshmarkChennai, Republic Of India, IN

We’re looking for an experienced.You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying ...Show moreLast updated: 14 days ago

Promoted

Senior Site Reliability Engineer (SRE) – Datadog Observability

Jade Globalnagpur, maharashtra, in

Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 2 days ago

Promoted

Sr. / Software Engineer

BrightEdgeNagpur, IN

Promoted

Staff Engineer Agentic [T500-21157]

ANSRnagpur, maharashtra, in

About Albertsons Companies Inc.As a leading food and drug retailer in the United States, Albertsons Companies, Inc.Our well-known banners across the United States, including Albertsons, Safeway, Vo...Show moreLast updated: 2 days ago