Talent.com
Staff Engineer- SRE

Staff Engineer- SRE

ConfidentialIndia
5 days ago
Job description

Urgent Hiring!!

Location : Remote

Role : Staff Engineer- SRE

Experience : 10+

The Site Reliability Engineering (SRE) team is responsible for the reliability, scalability,

stability and performance of systems and services.

  • They work with cross-functional teams to design, build and maintain systems and they

troubleshoot issues when they arise. They bridge the gap between development and

operations teams.

  • They work closely with business teams to define Service Level Objectives
  • (SLO) and agreements (SLA) of critical systems. They also monitor and maintain the

    uptime of these systems in-line with the defined SLO's and SLA's.

  • They deploy and manage monitoring tools to gain insights on system health and
  • performance.

  • They analyze performance, identify bottlenecks and implement solutions to
  • improve a system's scalability and latency durations.

  • They develop scripts, implement tools and automation frameworks to reduce the manual
  • intervention efforts of deployment, monitoring and scaling.

  • They work with development teams for design and development of observability
  • practices like logging, metrics, tracing, etc. They aim to diagnose and troubleshoot issues

    proactively.

  • They create actionable alerts on monitoring systems to ensure rapid response for
  • potential production incidents.

  • They forecast resource needs and provision adequately for current and future demand.
  • They design and execute 'chaos experiments' to test system's failure resiliency.
  • They own, define and implement the Disaster Recovery (DR) processes for systems.
  • They also conduct planned and unplanned mock DR drills to test for response
  • preparedness during production incidents.

  • They ensure that security best practices are followed and implemented during design
  • and operations of systems.

  • They also own and maintain documentation of processes, playbooks, and systems.
  • They publish KPI reports and other system health updates on a regular basis to the
  • business.

    Requirements

  • Must-have - Bachelor's degree, preferably in CS or a related field, or equivalent
  • Experience

  • Must-have - 12+ years of overall IT experience
  • Must-have - 7+ year of proven work experience as a Senior Site Reliability Engineer or a
  • similar position.

  • Must-have - 5+ years of AWS Cloud experience with AWS Certified DevOps Engineer or
  • SysOps or Security etc.

  • Must-have - AWS experience - 3+ years' experience with using a broadrange of AWS
  • technologies (e.g. EC2, RDS, ELB, S3, VPC, CloudWatch & Monitoring Tools) to develop

    and maintain an Amazon AWS based cloud solution, with an emphasis on best practice

    cloud security.

  • Must-have - 2+ year of experience in CDN and / or Cache systems like Fastly, Akamai,
  • CloudFront, etc.

  • Proven Understanding & strong experience with Cloud deployments ( AWS / Docker /
  • Kubernetes)

  • Knowledge on provisioning IAC Tools like Terraform, Chef, Ansible, Shell, groovy,
  • python, etc.

  • Experience with monitoring systems such as CloudWatch, NewRelic, Datadog / Splunk,
  • ELK stack.

  • Experience managing cloud network resources (AWS Preferred) such as CloudWatch,
  • VPC, URL proxies, private link, DNS, ACLs, firewalls, and C2S access points.

  • Platform or Application Engineering and Operational Knowledge in any of the CI / CD
  • tooling like GitHub Actions, Jenkins, etc.

  • Experience in other tooling Technologies like JIRA, Bitbucket, Jenkins, Fortify,
  • SonarQube, Nexus, Nexus IQ

  • Experience with configuration automation tools like Puppet / Ansible / Chef / Salt
  • Scripting Skills : Strong scripting (e.g. Bash & Python) and automation skills.
  • Operating Systems : Windows and Linux system administration.
  • Problem Solving : Ability to analyze and resolve complex infrastructure resource and
  • application deployment issues

  • Strong attention to detail. Excellent verbal and written communication skills. Strong
  • documentation skills.

    Good To Have

  • Experience with Terraform / Ansible / Chef / Puppet
  • Experience with GitHub Actions
  • Experience with CloudFront, Fastly
  • Oversees team members performing these functions
  • Anticipates problems and future technical needs and takes necessary steps to address
  • issues.

  • Work primarily in server side technologies and comfortable with client side whenever
  • Required

  • Enthusiastically follow technology trends, software engineering best practices and
  • technologies

    Perks

  • Day off on the 3rd Friday of every month (one long weekend each month)
  • Monthly Wellness Reimbursement Program to promote health well-being
  • Paid paternity and maternity leaves
  • Notice Period : Immediate- 30 Days

    Email to : [HIDDEN TEXT]

    Skills Required

    Newrelic, Chef, Fortify, Elk Stack, Bash, Datadog, Jira, Jenkins, Cloudwatch, Docker, Bitbucket, Terraform, Ansible, Sonarqube, Nexus, Splunk, Puppet, Python, Kubernetes, Aws

    Create a job alert for this search

    Staff Engineer • India

    Related jobs
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Nagpur, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Software Engineer

    Staff Software Engineer

    First American (India)Nagpur, IN
    The Staff Engineer is a senior technical leader responsible for setting engineering direction, delivering resilient platforms, and elevating engineering excellence across squads.You will drive high...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Engineer

    Staff Engineer

    Workfabric AIRepublic Of India, IN
    We are seeking an experienced Staff Engineer to lead the architecture, design, and large scale deployment of the ContextSensor, a core component of the ContextFabric platform.The ContextSensor powe...Show moreLast updated: 22 days ago
    • Promoted
    Staff Engineer

    Staff Engineer

    ConfidentialIndia
    ApplyBoard simplifies the study abroad search, application, and acceptance process by connecting international students, recruitment partners, and educational institutions on one intuitive and pers...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Staff Engineer

    Senior Staff Engineer

    ChargebeeChennai, Republic Of India, IN
    Chargebee is looking for an inspirational Senior Staff Engineer for driving the Next Generation of Subscription to create a revolutionary subscriptions experience for its customers.In this role, yo...Show moreLast updated: 1 day ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeNagpur, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 14 days ago
    • Promoted
    DevSecOps / AppSecOps Staff Engineer

    DevSecOps / AppSecOps Staff Engineer

    First American (India)Nagpur, IN
    Our people-first culture empowers bold thinkers and passionate technologists to solve real-world challenges through scalable architecture and innovative design. If you're driven by impact, thrive in...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Full Stack Engineer

    Sr Full Stack Engineer

    Mitchell Martin Inc.Nagpur, IN
    We’re looking for a Senior Full Stack Software Engineer who’s passionate about clean code, scalable architecture, and continuous improvement. You’ll collaborate across teams to design, develop, and ...Show moreLast updated: 2 days ago
    • Promoted
    Staff Engineer

    Staff Engineer

    OnArrivalnagpur, India
    OnArrival is redefining the travel tech industry by building the world’s most advanced full-stack travel platform.We provide seamless, intelligent travel infrastructure, powering everything from fl...Show moreLast updated: 16 days ago
    • Promoted
    Staff Software Engineer

    Staff Software Engineer

    Andalusia LabsNagpur, IN
    At Andalusia Labs, we build foundational economic infrastructure for programmable global markets, connecting capital, computation, and coordination across the internet. Our work sits at the intersec...Show moreLast updated: 1 day ago
    • Promoted
    Sr. Full Stack Engineer

    Sr. Full Stack Engineer

    BrightEdgeNagpur, IN
    BrightEdge is a global leader in enterprise SEO and content performance solutions, driving AI-powered digital marketing success for the world’s top brands. Our culture is product-first, innovation-d...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Engineer

    Staff Engineer

    Talent et au-delaPune, Republic Of India, IN
    Staff Engineer (Software Development).Core Technical Product Development Background).Location : Mumbai / Pune / Gurgaon / Noida. As Staff Engineer you will be Leading the application development with...Show moreLast updated: 2 days ago
    • Promoted
    Deployment Engineer

    Deployment Engineer

    AvocaNagpur, IN
    Build, launch & optimize AI agents that power the next generation of home-service customer experiences.Avoca is the all-in-one AI lead-conversion platform. Our technology boosts booking rates, slash...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Machine Learning Engineer

    Staff Machine Learning Engineer

    OcrolusNagpur, IN
    Come build at the intersection of AI and fintech.At Ocrolus, we’re on a mission to help lenders automate workflows with confidence—streamlining how financial institutions evaluate borrowers and ena...Show moreLast updated: 1 day ago
    • Promoted
    Staff Site Reliability Engineer

    Staff Site Reliability Engineer

    PoshmarkChennai, Republic Of India, IN
    We’re looking for an experienced.You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying ...Show moreLast updated: 14 days ago
    • Promoted
    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Senior Site Reliability Engineer (SRE) – Datadog Observability

    Jade Globalnagpur, maharashtra, in
    Senior Site Reliability Engineer (SRE) – Datadog Observability.SRE and Infrastructure Operations with minimum 3.Hyderabad preferable but open for Pune and remote. Site Reliability Engineer (SRE).SRE...Show moreLast updated: 2 days ago
    • Promoted
    Sr. / Software Engineer

    Sr. / Software Engineer

    BrightEdgeNagpur, IN
    BrightEdge is a global leader in enterprise SEO and content performance solutions, driving AI-powered digital marketing success for the world’s top brands. Our culture is product-first, innovation-d...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Engineer Agentic [T500-21157]

    Staff Engineer Agentic [T500-21157]

    ANSRnagpur, maharashtra, in
    About Albertsons Companies Inc.As a leading food and drug retailer in the United States, Albertsons Companies, Inc.Our well-known banners across the United States, including Albertsons, Safeway, Vo...Show moreLast updated: 2 days ago