Talent.com
This job offer is not available in your country.
Site Reliability Lead

Site Reliability Lead

ConfidentialHyderabad / Secunderabad, Telangana
12 days ago
Job description

We are looking for a Team Lead that is eager to build in a fast-paced, startup environment inside a stable, profitable company. Our teams are solving complex problems that impact the speed and effectiveness of the life sciences industry. In this role, you'll jump right in, develop in rapid sprints, and find quickly that we don't believe in throwaway technology. You build it - we ship it. You have extensive experience in Java applications and the latest open-source technologies. Ideal candidates have worked in enterprise software development or for a high-growth technology company.

Sr. Software Engineers on the Vault Site Reliability team at Veeva are innately curious and have a penchant for problem-solving. The scale in which you will be working supports hundreds of customers across North America, Europe, and Asia. Experience in enterprise software development and Java stack will make you successful in this role. You bring a unique engineering perspective to development as the expert in how all of the related systems and applications come together in production. You know what will work at scale.

What You'll Do

  • Head up a team of engineers, mentor, and provide onsite leadership.
  • Rapidly build new applications on an existing, robust enterprise platform.
  • Build new cloud infrastructure from scratch following the best practices in software development.
  • Drive new features and improvements in a fast-changing environment.
  • Partner with product management, design, and QA to deliver cutting-edge solutions and direct value to our customers.
  • Work on multiple layers of our stack including backend (primary), front-end, and Infrastructure.
  • Drive new features and improvements in a fast-changing environment.
  • Build tools and automation that eliminate work and reduce the time it takes to resolve an issue.
  • You want to make the system better every day and are self-driven to learn all that is necessary to provide full-stack diagnostics and determine the root cause of problems.
  • Ensure our platform meets the scalability and reliability needs of our customers.
  • During an incident, lead the effort to triage and mitigate.
  • You might need to perform periodic on-call duty if issues are escalated.
  • Strategize with engineering teams on complex problems.
  • You know how to support a system that is used by 3M users and can help dev teams make decisions based on recommendations of what will work in production before it ships.
  • Participate in engineering design reviews of new features.
  • Drive focused initiatives that improve operational efficiency and scalability of the platform.
  • Communicate effectively with engineering teams, and describe problems succinctly with sufficient detail that you can hand off an ongoing problem to another team or a peer for completion.
  • Engage in real-time communication during outages with both technical and non-technical audiences.

Requirements

  • 8+ years experience in Java, preferably at an enterprise cloud software company.
  • Proven ability to write clean, testable, readable code in a team environment.
  • Hands-on experience with open-source technologies, such as Spring, MySQL, Hibernate, Solr, Maven, Git, Tomcat, Linux, AWS, Vagrant, Docker, Kubernetes.
  • 3+ years of experience in relational databases with a mastery of SQL.
  • Demonstrated history of incident management and leadership ability.
  • Experience in handling production outages and root-cause analysis.
  • Hands-on operational experience in a high-volume or critical production service environment.
  • Effective communication skills across all levels - whether talking to individual contributors or executives.
  • Solid scripting skills; experience with Shell, Bash, Ansible, Python, Go, Ruby, etc.
  • Ability to handle the periodic, on-call duty.
  • Fluent in English both written and verbal.
  • We are looking for strong mentors with a proven record of making your team better.
  • Skills Required

    Incident Management, Shell Scripting, Relational Databases, Sql, Java, Python

    Create a job alert for this search

    Lead Site Reliability • Hyderabad / Secunderabad, Telangana

    Related jobs
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ValueMomentumHyderabad, Telangana, India
    Site Reliability / Azure DevOps Engineer with Dynatrace Experience.CI / CD practices, infrastructure automation, and cloud operations. The ideal candidate will have deep expertise in Azure DevOps, Inf...Show moreLast updated: 14 hours ago
    • Promoted
    Engineer, Site Reliability [T500-20502]

    Engineer, Site Reliability [T500-20502]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 8 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    AutoRABITHyderabad, Telangana, India
    AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show moreLast updated: 15 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    UplersSecunderabad, Telangana, India
    Uplers is hiring for one of the clients.Role Details : Position : SRE (Oracle Cloud Infrastructure) Type : 10-month contract (possible extension) Mode : Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST Pol...Show moreLast updated: 24 days ago
    • Promoted
    Engineer, Site Reliability [T500-20266]

    Engineer, Site Reliability [T500-20266]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer - AIOps / Observability Services

    Site Reliability Engineer - AIOps / Observability Services

    Intraedge Technologies Ltd.Hyderabad
    L2Observability / AIOps : Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, m...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Observability Services

    Site Reliability Engineer - Observability Services

    TeamWare SolutionsHyderabad
    Role Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on observability.The ideal candidate will have 5-8 years of experie...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Site Reliability [T500-20515]

    Engineer, Site Reliability [T500-20515]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 8 days ago
    • Promoted
    Engineer, Site Reliability [T500-20504]

    Engineer, Site Reliability [T500-20504]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 8 days ago
    • Promoted
    Engineer, Site Reliability [T500-20521]

    Engineer, Site Reliability [T500-20521]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 8 days ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20279]

    Sr Engineer, Site Reliability [T500-20279]

    ANSRHyderabad, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    Engineer, Site Reliability [T500-20518]

    Engineer, Site Reliability [T500-20518]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordHyderabad, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 18 days ago
    • Promoted
    Engineer, Site Reliability [T500-20519]

    Engineer, Site Reliability [T500-20519]

    ANSRHyderabad, Telangana, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 8 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftHyderabad, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 19 hours ago
    • Promoted
    Sr Engineer, Site Reliability [T500-20437]

    Sr Engineer, Site Reliability [T500-20437]

    ANSRHyderabad, India
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 6 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne Solutionshyderabad, telangana, in
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 16 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Insight Globalhyderabad, telangana, in
    Must be able to join within 30 days or less!.An employer is looking for an SRE to join their enterprise level SRE team.They are building a specialized team of Senior Site Reliability Engineers to a...Show moreLast updated: 30+ days ago
    • Promoted
    Engineer, Site Reliability [T500-20517]

    Engineer, Site Reliability [T500-20517]

    ANSRhyderabad, telangana, in
    ANSR is hiring for one of its clients.NASDAQ : TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flags...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GSPANN Technologies, IncHyderabad, Telangana, India
    GSPANN is a global IT services and consultancy provider headquartered in Milpitas, California (U.With five global delivery centers across the globe, GSPANN provides digital solutions that support t...Show moreLast updated: 30+ days ago