Talent.com
This job offer is not available in your country.
Monitoring Engineer - Site Reliability

Monitoring Engineer - Site Reliability

Insight Global, LLCBangalore
9 days ago
Job description

Job Title : LLM System Monitor - Site Reliability Engineer (SRE).

Location : Bangalore, India (Hybrid - Onsite 3 Days / Week).

Type : Full-Time (Insight Global at Cisco).

Required Skills & Experience :

  • 3+ years of experience monitoring and responding to incidents in a globally deployed web application.
  • Strong experience with microservices architecture on Kubernetes.
  • Deep understanding of observability tools and operational metrics (Grafana, Prometheus, P99, etc.
  • Familiarity with AWS services or any major cloud provider.
  • Excellent communication and customer service skills - must be able to clearly articulate status and updates to technical and non-technical stakeholders.
  • Ability to ramp up quickly, take ownership, and work independently in a fast-pace.

Key Responsibilities :

  • Monitor Grafana dashboards and observability tools to detect failures and performance issues.
  • Act as the primary SRE for incident response, initiating reports from automated alerts or joining active incident channels.
  • Serve as the main point of contact during incidents, delivering frequent updates to customers and incident commanders.
  • Interpret operational metrics such as Quantiles, P99, and Prometheus data to assess system health.
  • Track and manage permutations of a globally deployed microservices architecture running on Kubernetes.
  • Collaborate with engineering and support teams to resolve issues quickly and efficiently.
  • Maintain strong communication and customer service throughout incident lifecycles.
  • Utilize foundational knowledge of AWS or other cloud platforms to support infrastructure monitoring.
  • Ramp up quickly on existing systems and processes.
  • Why Join?

  • Work with cutting-edge LLM infrastructure at Cisco.
  • Full-time opportunity with Insight Global.
  • Hybrid flexibility - onsite in Bangalore 3 days / week.
  • Immediate interviews and onboarding.
  • Competitive compensation.
  • (ref : hirist.tech)

    Create a job alert for this search

    Site Reliability Engineer • Bangalore

    Related jobs
    • Promoted
    Site Reliability Engineer - Observability Services

    Site Reliability Engineer - Observability Services

    TeamWare SolutionsBangalore
    Role Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong focus on observability.The ideal candidate will have 5-8 years of experie...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Vbeyond corporationBangalore
    SRE (Site Reliability Engineer 2) We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tool...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftBengaluru, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 1 hour ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ConcordBengaluru, IN
    Engineers (Individual Contributors).Strong SRE (Site Reliability Engineering).CI / CD, monitoring, automation, infrastructure as code, etc.Show moreLast updated: 17 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions in...Show moreLast updated: 16 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ViewSonicbangalore, karnataka, in
    At ViewSonic Technologies, we’re passionate about building software that solves problems.We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availabilit...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne SolutionsBengaluru, Karnataka, India
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 1 hour ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TavantBengaluru, Karnataka, India
    With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 25 days ago
    • Promoted
    Observability - Engineer Site Reliability [T500-20244]

    Observability - Engineer Site Reliability [T500-20244]

    Albertsons Companies IndiaBengaluru, Karnataka, India
    About Albertsons Companies Inc.As a leading food and drug retailer in the United States, Albertsons Companies, Inc.Our well-known banners across the United States, including Albertsons, Safeway, Vo...Show moreLast updated: 7 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2hosur, tamil nadu, in
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 7 days ago
    • Promoted
    Angel One - Site Reliability Engineer - Monitoring Tools

    Angel One - Site Reliability Engineer - Monitoring Tools

    ANGEL ONE LIMITEDBangalore
    Job Title : SRE2 Location : Bengaluru, Karnataka What you will do : - Design, write an...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaBengaluru, Karnataka, India
    AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE).The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI / CD, monit...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    People Realm Recruitment Services Private LimitedBengaluru, Karnataka, India
    Job Title- Site Reliability Engineer.Desired Years of Experience - 5 - 14 Years of Relevant Experience.A Career with a Leading Global Investment Management Firm’s Technology Team.Our client, a lead...Show moreLast updated: 20 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Uplershosur, tamil nadu, in
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub ServicesBengaluru, Karnataka, India
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebiahosur, tamil nadu, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 7 days ago