Talent.com
This job offer is not available in your country.
Observability - Engineer Site Reliability [T500-20244]

Observability - Engineer Site Reliability [T500-20244]

Albertsons Companies IndiaBengaluru, Karnataka, India
10 days ago
Job description

About Albertsons Companies Inc. (ACI) :

As a leading food and drug retailer in the United States, Albertsons Companies, Inc. operates over 2,200 stores across 35 states and the District of Columbia. Our well-known banners across the United States, including Albertsons, Safeway, Vons, Jewel-Osco and others, serve more than 36 million U.S customers each week.

We build and shape technology solutions that solve customers’ problems every day, making things easier for them when they shop with us online or in a store. We have made bold, strategic moves to migrate and modernize our core foundational capabilities, positioning ourselves as the first fully cloud-based grocery tech company in the industry.

Our success is built on a one-team approach, driven by the desire to understand and enhance the customer experience. By constantly pushing the boundaries of retail, we are transforming shopping into an experience that is easy, efficient, fun and engaging.

About Albertsons India Capability Center :

At Albertsons India Capability Center, we're not just pushing the boundaries of technology and retail innovation, we're cultivating a space where ideas flourish and careers thrive. Our workplace in India is a vital extension of the Albertsons Companies Inc. workforce and important to the next phase in the company’s technology journey to support millions of customers’ lives every day.

At the Albertsons India Capability Center, we are raising the bar to grow across Technology & Engineering, AI, Digital and other company functions, and transform a 165-year-old American retailer. At Albertsons India Capability Center, associates collaborate directly with international teams, enhancing decision-making processes and organizational agility through exciting and pivotal projects. Your work will make history and help millions of lives each day come together around the joys of food and inspire their well-being.

What you will be doing :

This role will be an individual contributor responsible for building and finetuning the platform components for the Observability product. The candidate will work closely with the Lead engineer, performance team, data ingestion, platform DevOps and data visualization teams under Observability product. As a member of the platform team, the candidate needs to be able to support and maintain the applications onboarded to Grafana Observability, Ingestion and visualization written in PromQL, Log queries, etc., and monitoring technologies.

This position will preferably be based out of India GCC, Bangalore.

Key Responsibilities :

  • Experience in Observability and Monitoring initiatives as platform Engineer.
  • Troubleshoot platform issues and restore service by resolving customer-facing incidents
  • Development and implementation of build release pipelines with accountability for managing deployment schedules, issues, risks, and impediments.
  • Agile development experience with team member accountability for commitment and delivery each sprint.
  • Troubleshoot and implement corrections to problems associated with connectivity between the supported applications and the clients they serve
  • Provide technical guidance, in the diagnosis of issues as they arise in support of critical applications
  • Drive collaboration sessions among IT and business groups to facilitate optimal support and operation of the relevant applications
  • Provide Site Reliability Engineering techniques such as observability, alerting and performance tuning
  • Contribute to the design, implementation, and enhancement of critical applications
  • Perform proactive analysis and troubleshooting to predict and prevent production incidents
  • Define and contribute to monitoring capabilities for critical applications
  • Collaborate with key vendors on functional, performance and capacity improvements
  • Design and build tools to automate support and monitoring functions
  • Ensure that all implementations of observability meet the requirements prescribed by IT Services through the effective implementation or use of approved processes, methodologies, and deliverables.
  • Provide expertise and build solutions for observability applications as well as system integration with internal systems and external vendors.
  • Able to provide coding and technical direction to less experienced staff or develops highly complex original code.
  • Track infrastructure delivery and dependencies to implementation.

We are searching for someone with the following skills :

  • Experience with gathering and organizing large volume of data to use for instrumentation into an Enterprise Observability solution.
  • Experience with recommending baseline monitoring thresholds, and performance monitoring KPIs and SLAs.
  • Experience with installing agents, forwarders, APIs, performance monitoring alerts, dashboards, and data trend analysis.
  • Good Knowledge and understanding of Azure foundation components e.g. App GW, APIM, Virtual Network, NSG, Load Balancer, Azure VM etc. is required.
  • Experience with Databases Azure SQL, PostgreSQL, MySQL, MongoDB, TSDB or similar databases.
  • Knowledge of monitoring tools such as Log Analaytics, App Dynamics, Grafana, Prometheus, Splunk, and Sitescope
  • Azure / GCP hands-on with details around pulling observability data from managed services
  • Golang / Python coding or from solutioning background with experience on SRE development and Open telemetry implementation
  • Deploying / managing and optimizing enterprise level observability platform for Grafana OSS products like Mimir,Loki,Tempo, Fluentbit / Vector
  • Design and develop standard Grafana dashboards for critical metrics for various Azure / GCP services using the observability data
  • Experience must include at least one of the following languages : Java (required), Desired Python, GoLang, node.js
  • Experience in working with ServiceNow or similar Service Management tools
  • Familiarity with Cloud technologies in Azure, AWS, and Google Cloud
  • Experience on PCF, Docker, Kubernetes platform is required.
  • Experience with DevOps and CI / CD tools and processes is required.
  • Experience in high-performance and high-frequency data streaming (using Kafka etc.) and handling large volume of batch data is strongly preferred but not required.
  • Experience with Agile / Scrum methodologies is required.
  • We believe the successful candidate has these qualifications and experience :

  • 4-year degree (Computer Science, Information Systems, or relational functional field) and / or equivalent combination of education or work experience.
  • 1-3+ years of experience on integration engineering related to Observability / Monitoring framework with open source technologies such as Grafana, Mimir, Loki, Tempo, Fluentbit, Vector etc.,
  • Hands-on experience with Tools and Technology is preferred.
  • 1+ years of experience as a System Reliability Engineer is required.
  • Experience working with Open-source platforms and Open Telemetry libraries e.g. Grafana is preferred.
  • Create a job alert for this search

    Site Reliability Engineer • Bengaluru, Karnataka, India

    Related jobs
    • Promoted
    • New!
    Lead Site Reliability Engineer

    Lead Site Reliability Engineer

    SITABengaluru, Karnataka, India
    At SITA, we keep airports moving, airlines flying smoothly, and borders open.Our technology and communication innovations power the success of the global air travel industry.Youll find us in 95% of...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Swiss ReBengaluru, Karnataka, India
    You would be playing a key role in ensuring the reliability, stability, scalability and security of our Logging & Monitoring cloud systems and infrastructure. You will be designing, implementing, an...Show moreLast updated: 5 hours ago
    • Promoted
    Sr. Site Reliability Engineer [T500-20179]

    Sr. Site Reliability Engineer [T500-20179]

    Delta Air LinesBengaluru, Karnataka, India
    Delta Air Lines (NYSE : DAL) is the U.Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-...Show moreLast updated: 20 days ago
    • Promoted
    • New!
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    Procore TechnologiesBengaluru, Karnataka, India
    Senior Site Reliability Engineer.Procore’s Product & Technology Team.Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are t...Show moreLast updated: 5 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftBengaluru, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Thomson ReutersBrookefield, Karnataka, India
    Are you passionate about the chance to bring your experience to a world-class company that is market-leading for both content and technology? If yes, we are looking for you!.We are looking for a Si...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    OracleBengaluru, Karnataka, India
    Looking for a DevOps Senior Engineer in the Data Engineering team who can help us support next-generation Analytics applications over Oracle cloud. This posting is for DevOps Senior Engineer in the ...Show moreLast updated: 5 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions in...Show moreLast updated: 20 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TavantBengaluru, Karnataka, India
    With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 28 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Neptune Retail SolutionsBengaluru, Karnataka, India
    Quotient a subsidiary of Neptune Retail Solutions is the leading digital media and promotions technology company that creates cohesive omnichannel brand-building and sales-driving opportunities to ...Show moreLast updated: 5 hours ago
    • Promoted
    Staff Site Reliability Engineer (Observability)

    Staff Site Reliability Engineer (Observability)

    Palo Alto NetworksBengaluru, Karnataka, India
    Our Mission At Palo Alto Networks® everything starts and ends with our mission : Being the cybersecurity partner of choice, protecting our digital way of life. Our vision is a world where each day ...Show moreLast updated: 8 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne SolutionsBengaluru, Karnataka, India
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Aqilea (formerly Soltia)Bengaluru, Karnataka, India
    We are a consulting company with a bunch of technology-interested and happy people!.We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and...Show moreLast updated: 5 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaBengaluru, Karnataka, India
    Performance & Reliability Engineer ( Senior, Lead , Principal & Manager).Location : Pune, Chennai, Bangalore & Gurgaon.Role : Performance & Reliability Engineer. Job Location : Gurgaon, Chennai, Pune, ...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    AIONBengaluru, Karnataka, India
    AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance,...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    WissenBengaluru, Karnataka, India
    Wissen Technology is Hiring for Site Reliability Engineer.About Wissen Technology : Wissen Technology is a globally recognized organization known for building solid technology teams, working with ma...Show moreLast updated: 5 hours ago