Talent.com
No longer accepting applications
Principal Site Reliability Enginee

Principal Site Reliability Enginee

ConfidentialHyderabad / Secunderabad, Telangana, Chennai, Bengaluru / Bangalore
30+ days ago
Job description

As a Principal Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer s business goals, needs and general business environment. You will work with product management, other engineering teams, customer success and support on developing cutting edge new product features and enhancements across various areas of Boomi offerings.

You will :

  • Participate actively in detecting, remediating and reporting on Production incidents, ensuring the SLAs / SLOs are defined and met
  • Participate in on-call rotation to ensure coverage for planned / unplanned events.
  • Engage with other Engineering organizations to implement processes, identify improvements, and drive consistent results.
  • Working with your SRE and Engineering counterparts for driving DR exercises, Game days, training and other response readiness efforts.
  • Collaborate with Service Engineering organizations to build and automate tooling, implement best practices on Observability and manage the Boomi services in production and consistently achieve our market leading SLA.
  • Improving the scalability and reliability of Boomi s systems in production.
  • Automate the provisioning and maintenance of Boomi s infrastructure.
  • Work independently with a minimal level of guidance from technical leadership
  • Mentor other Boomi engineers, including design collaboration and code reviews

Take the first step towards your dream career with Boomi

Essential Requirements

  • Passionate about SRE, DevOps, Automation and infrastructure platforms. Expert in developing Ansible playbooks and automation for Infrastructure as code using Terraform and Cloud Formation Templates.
  • Expert in defining, measuring, and improving Reliability Metrics (SLO / SLI / Error budgets)
  • Strong in implementing observability practices (Monitoring, Logging, Distributed Tracing etc.) preferably using Splunk and New Relic.  Experience not limited to using the dashboards, but creating them from scratch.
  • Experience in conducting and automating DR exercise in AWS cloud thus validating RPOs and RTOs.
  • Strong understanding and working experience with AWS components.
  • Ability to design and implement API s for use by internal teams.
  • Desirable Requirements

  • 6 to 8 years of related experience in the software engineering industry, with experience supporting large scale software systems in production.
  • Certified in Cloud (AWS / Azure / GCP), experience in using services such as computers, containers and databases.
  • Experience in Ansible / Terraform and Python.
  • A grasp of Cloud Native concepts, containerization best practices and security awareness in Cloud will be a strong plus.
  • Experience in Observability, creating dashboards for SLA / SLI / SLO
  • Role :   Site Reliability Engineer

    Industry Type :   IT Services & Consulting

    Department :   Engineering - Software & QA

    Employment Type :   Full Time, Permanent

    Role Category :   DevOps

    Education

    UG :   Any Graduate

    PG :   Any Postgraduate

    Skills Required

    Rtos, Product Management, Gcp, Service Engineering, Boomi, Splunk, Http, Automation, Python, Monitoring

    Create a job alert for this search

    Principal Reliability • Hyderabad / Secunderabad, Telangana, Chennai, Bengaluru / Bangalore

    Related jobs
    • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Engineer, Site Reliability Engineer [T500-20464]

    Sr Engineer, Site Reliability Engineer [T500-20464]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    AutoRABITHyderabad, Republic Of India, IN
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    IntraEdgeHyderabad, IN
    Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Strategic thinking with a focus on long-term operational excellence.Champion operation...Show moreLast updated: 20 days ago
    • Promoted
    Engineer, Site Reliability [T500-20502]

    Engineer, Site Reliability [T500-20502]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer T500-21132

    Site Reliability Engineer T500-21132

    InspireHyderabad, Republic Of India, IN
    Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will l...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    NationsBenefits IndiaHyderabad, Telangana, India
    Site Reliability Engineer (SRE) | Fintech | Kubernetes | Datadog |.SRE team focused on maintaining the performance, reliability, and availability of our fintech platforms.Triage and resolve product...Show moreLast updated: 28 days ago
    • Promoted
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    FACTSETHyderabad, India
    FactSet creates flexible, open data and software solutions for over 200,000 investment professionals worldwide, providing instant access to financial data and analytics that investors use to make c...Show moreLast updated: 22 days ago
    • Promoted
    Principal Engineer, Site Reliability T500-20295

    Principal Engineer, Site Reliability T500-20295

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CapgeminiHyderabad, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer [T500-21132]

    Site Reliability Engineer [T500-21132]

    Inspirehyderabad, telangana, in
    Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies.The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will l...Show moreLast updated: 7 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    InfosysHyderabad, Republic Of India, IN
    We are seeking a skilled and motivated Site Reliability Engineer with hands-on expertise.DevOps tools, and SRE principles. Provide production support for Production applications, ensuring the stabil...Show moreLast updated: 20 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    CitNOW GroupHyderabad, Telangana, India
    About us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably.CitNOW’s app-...Show moreLast updated: 5 days ago
    • Promoted
    Principal Engineer, Site Reliability [T500-20295]

    Principal Engineer, Site Reliability [T500-20295]

    TMUS Global SolutionsHyderabad, Telangana, India
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago
    • Promoted
    Principal Site Reliability Engineer

    Principal Site Reliability Engineer

    TMUS Global SolutionsHyderabad, Republic Of India, IN
    NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mo...Show moreLast updated: 30+ days ago