Talent.com
This job offer is not available in your country.
Site Reliability Engineer II [Immediate Start]

Site Reliability Engineer II [Immediate Start]

RecRootsIndia
7 hours ago
Job description

Key Job Responsibilities and Duties :

The core premise for the SRE lies in treating operational issues as a software problem.

We code our way out of problems where operations are concerned addressing availability,

scalability, latency, and efficiency challenges within the vast infrastructure here.

  • You will impact millions of people all over the globe with your creative solutions
  • You work in one of the biggest e-commerce companies in the world
  • You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers
  • You will have the opportunity to collaborate with many of the world’s leading SREs
  • You will be free to launch your own ideas and solutions within our sophisticated production environment
  • Here are some of the tools and technologies we use to achieve this : Python, Go, Puppet, Kubernetes, Elasticsearch, Prometheus, HAProxy, Cassandra, Kafka etc

What you’ll be Doing :

  • Design, develop and implement systems software that improves the stability, scalability, availability and latency of the products;
  • Take ownership of one or more services and have the freedom to do what is best for our business and customers;
  • Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again;
  • Build effective monitoring to monitor the health of your system, and jump in to handle outages;
  • Build and run capacity tests to handle the growth of your systems;
  • Plan for reliability by designing systems to work across our multinational data centers;
  • Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day;
  • Share the on-call rotation and be an escalation contact for incidents (depending on level of role)
  • What you’ll bring :

  • Solid experience in at least one programming language.
  • Experience with building, operating and maintaining scalable distributed systems, and with operations automation;
  • Experience with Infrastructure as Code technologies;
  • Knowledge of cloud computing fundamentals;
  • Solid foundation in Linux administration and troubleshooting;
  • Understanding of Service level agreements and objectives;
  • Additional experience in OpenStack, Kubernetes, Networking, Security or Storage is desirable;
  • Monitoring / observability technologies like Prometheus, Graphite, Grafana, Kibana, Elasticsearch are a plus;
  • Good interpersonal skills
  • Proficient command of the English language, both written and spoken
  • Create a job alert for this search

    Site Reliability Engineer • India

    Related jobs
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.Nagpur, IN
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    Staff Site Reliability Engineer (Observability)

    Staff Site Reliability Engineer (Observability)

    Palo Alto NetworksIndia
    At Palo Alto Networks® everything starts and ends with our mission : .Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day is safer and m...Show moreLast updated: 18 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebianagpur, maharashtra, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 20 days ago
    • Promoted
    AWS Site Reliability Engineer

    AWS Site Reliability Engineer

    HTC Global ServicesIndia
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show moreLast updated: 8 days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Media.netIndia
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 12 days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20439]

    Sr Engineer, Site Reliability [T500-20439]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    Rakuten IndiaIndia
    Design, develop SLA, SLO, SLI of services within the Business Unit.Involve in whole process of Development, Production System Operation including system maintenance, monitoring, automation, backend...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20437]

    Sr Engineer, Site Reliability [T500-20437]

    TMUS Global SolutionsIndia
    About T-Mobile : T-Mobile US, Inc.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship b...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Teksands.aiIndia
    Experience in One Identity tool (preferred) operations or similar IAM tools.Devops Engineer with expertise in Kubernetes, Docker, Azure, AWS, Deployment Vmware. Knowledge in DevOps tools of Github / A...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Luxoft IndiaIndia
    We are looking for an experienced technical developer to work for one of our client from the banking industry.Project goal is to maintain and develop solutions. Design, develop, and improve the digi...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalIndia
    Service Management : Maintain application uptime / performance, manage system enhancements and defects, oversee daily operational activities, and ensure continuous improvement and adherence to ITIL be...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20425]

    Sr Engineer, Site Reliability [T500-20425]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer II

    Site Reliability Engineer II

    RecRootsIndia
    Key Job Responsibilities and Duties : .The core premise for the SRE lies in treating operational issues as a software problem. We code our way out of problems where operations are concerned addressing...Show moreLast updated: 8 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20286]

    Sr Engineer, Site Reliability [T500-20286]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ValueMomentumIndia
    About the Role We are seeking an experienced.Site Reliability / Azure DevOps Engineer with Dynatrace Experience.CI / CD practices, infrastructure automation, and cloud operations.The ideal candidate ...Show moreLast updated: 12 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronIndia
    We have immediate opportunity for Senior Site Reliability Engineer.Job Role : Senior Site Reliability Engineer.Job Location : Synechron ( Bengaluru / Pune). At Synechron, we believe in the power of dig...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    QualityKiosk Technologies Pvt. Ltd.India
    QualityKiosk Technologies is one of the world's largest independent Quality Engineering (QE) providers and digital transformation enablers, helping companies build and manage applications for optim...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TechVeritoIndia
    About the Role : 3-5 years of proven and progressive experience as an.As a SRE Engineer, you will have a strong background in cloud infrastructure management, migration and deployment, with expertis...Show moreLast updated: 12 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20446]

    Sr Engineer, Site Reliability [T500-20446]

    TMUS Global SolutionsIndia
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 1 day ago