Talent.com
This job offer is not available in your country.
Site Reliability / DevOps Engineer

Site Reliability / DevOps Engineer

Loyalytics AIBangalore
30+ days ago
Job description

About Loyalytics :

Loyalytics is a fast-growing Analytics consulting and product organization based out of Bangalore. We work with large retail clients across the globe helping them monetize their data assets through our consulting assignments and product accelerators.

We are a young dynamic team of 100+ analytics practitioners working on some of the most cutting-edge tools and We Are :

  • Technical team : A team full of data scientists, data engineers and business analysts who work with 1M+ data points every day.
  • Market Size : Massive multi-billion $ global market opportunity.
  • Leadership : Combined experience of 40+ years of experience in the industry.
  • Customers : Word-of-mouth and referral driven marketing to acquire customers like big retail brands in GCC regions like Lulu, GMG, among others (Strong product-market fit).
  • What makes us stand apart : 8 years old bootstrapped and 100+ people company that is still The Job :

We're looking for a hands-on Site Reliability / DevOps Engineer to be our first hire in this function, responsible for owning and scaling the reliability, observability, and infrastructure of our platform running entirely on Microsoft Azure.

You'll be critical in shaping DevOps culture, architecting fault-tolerant systems, and deploying automation to improve uptime, performance, and cost efficiency.

This is a hybrid role combining SRE and DevOps principles - ideal for builders comfortable working in fast-paced, product-driven You'll Own :

Cloud Infrastructure (Microsoft Azure Must Have) :

  • Architect, deploy, and maintain services across Azure App Services, Azure Container Apps, Cosmos DB, Event Hubs, Azure Monitor, Azure VMs, and Azure Kubernetes Service (AKS).
  • Design and manage networking (VNets, Subnets, NSGs) and identity / access controls (PIM, Managed Identities, Enterprise Applications, Role-based Access Control).
  • Own infrastructure provisioning using Terraform / Bicep.
  • Implement cost-effective, scalable, and secure cloud environments across development, staging, and production.
  • Monitoring, Observability & Incident Response :

  • Set up end-to-end observability using Prometheus, Grafana, Azure Monitor, ELK Stack, and Sentry.
  • Define and enforce standards for logging, metrics, traces, SLIs / SLOs, and error budgets.
  • Build proactive alerting systems for APIs, RabbitMQ, Databricks pipelines, and external integrations.
  • Establish on-call rotations, incident response runbooks, and lead RCAs to minimize MTTR.
  • CI / CD, Automation & Tooling :

  • Automate deployments and infrastructure lifecycle using GitHub Actions, Terraform modules, and CLI tools.
  • Improve CI / CD for faster, safer releases across containerized and VM-based workloads.
  • Build internal tools for diagnostics, rollback safety, and release automation.
  • Integrate resilience patterns : retries, circuit breakers, backoff strategies, failovers.
  • DevOps & System Reliability :

  • Optimize system performance, memory usage, and availability for core services like RabbitMQ, APIs, analytics pipelines on Databricks.
  • Implement zero-downtime deployments, self-healing systems, and infrastructure audits.
  • Perform regular cost analysis, right-sizing, and tag-based budget enforcement.
  • Security & Compliance Collaboration :

  • Work with security teams to maintain infrastructure and data flow diagrams, support ISO 27001, GDPR, PDPA readiness.
  • Participate in threat modeling, define trust boundaries, and implement audit-ready infrastructure Stack You'll Work With :
  • Cloud : Microsoft Azure (App Services, Container Apps, AKS, Cosmos DB, Event Hubs, Monitor, VMs).
  • IaC : Terraform, Bicep.
  • CI / CD : Azure Devops,GitHub Actions.
  • Monitoring & Logs : Prometheus, Grafana, Azure Monitor, ELK, Sentry.
  • Queueing : RabbitMQ, Kafka.
  • Languages : Node.js, Python (mostly for Culture :
  • The culture of Loyalytics plays the center stage of how we interact, act and respond to people and situations. We are guided by the principles of FACET in our day-to-day interactions and performance. We have a well-defined culture which clearly spells out the acceptable and unacceptable behaviors at Loyalytics.

    FACERT is a combination of specific values where

    F stands for fulfilling our promises :

    We believe in delivering on our commitments within the agreed-upon timeframe. This includes following through on commitments, communicating progress while being flexible to adapt to changes or any unexpected circumstances that may arise and still strive to meet our commitments.

    A stand for actionable recommendations :

    We believe in creating real impact by providing recommendations that are creative, forward-thinking, or outside-the-box and that have the potential to make a significant positive impact.

    Providing recommendations that are easy to understand and implement and ensuring follow up and support is our way of ensuring that the recommendations provided are implemented effectively.

    C stands for customer empathy :

    We are always looking for ways to make our customers experience great and believe in providing the highest quality service possible. We make satisfied customers through our regular interactions, active listening, promptness in response to inquiries and concerns and are flexible to adapt to the customers' unique needs and preferences.

    Our culture encourages the approach to anticipate our customer needs and take the necessary steps to address them before they become issues. We also practice a proactive exercise of checking in on the customer to ensure their needs are met and enquire if there are any additional needs or concerns. Our definition of customer includes our external as well as internal customers, based on our roles and responsibilities.

    E stands for extra mile :

    The culture at Loyalytics encourages you to take initiative and go above. and beyond what is expected of you. We encourage employees to constantly look for ways to improve their skills, knowledge, and performance. Being adaptable and open to learning helps us through changing circumstances and apply our learning to the work we do.

    R stands for Respect.

    We believe that every employee at Loyalytics is equally important and equally respected. We practice respect in our communication, actions, feedback and behaviors, exercise empathy and ensure to understand the other person's point of view and objectively state our point of view. We believe in transparency in our processes and procedures and abide by the principles of professionalism.

    T stands for teamwork :

    We believe in open communication where everyone is encouraged to reach out to anyone within the organization, share information and ideas, and listen actively to the viewpoints of others. Our aim is to reach the organization's goals through effective teamwork, support and collaboration.

    We have a culture of respect for all regardless of background and experience. If these values resonate, then Loyalytics is the place for you! Come join us and be part of the dynamic team racing towards being recognized as the most preferred retail organization in the world!

    (ref : hirist.tech)

    Create a job alert for this search

    Site Engineer • Bangalore

    Related jobs
    • Promoted
    Site Reliability Engineer Engineer - DevOps

    Site Reliability Engineer Engineer - DevOps

    Zealant Consulting GroupBangalore
    Job Summary : We are seeking a seasoned Site Reliability Engineer (SRE) Engineer to join our growing team.This is a crit...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    Bachelor's degree in Computer Science, Engineering, or a related field.Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions in...Show moreLast updated: 16 days ago
    • Promoted
    Senior Site Reliability Engineer- ELK Expert

    Senior Site Reliability Engineer- ELK Expert

    iVedha Inc.hosur, tamil nadu, in
    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice.Must be available to work in the EST (US / Canada) Time Zone. Are you a Senior Site Reliability Engineer (SRE) with ...Show moreLast updated: 30+ days ago
    • Promoted
    ThoughtSpot - Site Reliability Engineer - DevOps

    ThoughtSpot - Site Reliability Engineer - DevOps

    THOUGHTSPOT INDIA PRIVATE LIMITEDBangalore
    About The Role ThoughtSpot is an AI-powered analytics platform that enables users to explore and analyze data through natural language queries, making insights access...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Central Business Solutions Inc.Bengaluru, India
    Linux SRE (Linux SRE L3 with Infra + Operation Support).The Server Operations team is part of the Enterprise Computing organization within Client. The wider team has presence in cities globally and ...Show moreLast updated: 3 days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    BayOne SolutionsBengaluru, Karnataka, India
    Role : Site Reliability Engineer.The CXE Site Reliability Engineering (SRE) team manages the CI / CD pipelines and cloud infrastructure, ensuring seamless deployment, monitoring, and maintenance.Howev...Show moreLast updated: 5 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    TavantBengaluru, Karnataka, India
    With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tec...Show moreLast updated: 26 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    WSO2hosur, tamil nadu, in
    Founded in 2005, WSO2 is the largest independent software vendor providing open-source API management, integration, and identity and access management (IAM) to thousands of enterprises in over 90 c...Show moreLast updated: 7 days ago
    • Promoted
    DevOps / Platform Engineer

    DevOps / Platform Engineer

    iVedha Inc.hosur, tamil nadu, in
    Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    XebiaBengaluru, Karnataka, India
    AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE).The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI / CD, monit...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    ExasoftBangalore, IN
    Responsibilities and Requirements : .Experience must be at least 10+ years in SRE.Multi Cloud, Hybrid Cloud – on Data center sites. Experience with multiple operating systems (.Operating Systems, Kern...Show moreLast updated: 5 hours ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    People Realm Recruitment Services Private LimitedBengaluru, Karnataka, India
    Job Title- Site Reliability Engineer.Desired Years of Experience - 5 - 14 Years of Relevant Experience.A Career with a Leading Global Investment Management Firm’s Technology Team.Our client, a lead...Show moreLast updated: 20 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Uplershosur, tamil nadu, in
    Uplers is hiring for one of the clients.SRE (Oracle Cloud Infrastructure).Remote | Mon–Fri | 10 : 30 AM – 7 : 30 PM IST.Use of personal device required. OCI cloud infrastructure using Terraform and GitL...Show moreLast updated: 24 days ago
    • Promoted
    Site Reliability Engineer - DevOps

    Site Reliability Engineer - DevOps

    Whitefield CareersBangalore
    Key Responsibilities : - Troubleshoot complex issues in Linux environments and conduct application-level debugging.Manage and provision infrastructure using Terraform...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Amicon Hub ServicesBengaluru, Karnataka, India
    Manage and scale production systems hosted on.Automate operational tasks using.Improve system reliability and reduce manual interventions through automation. Collaborate with development teams to en...Show moreLast updated: 6 days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    ViewSonicBengaluru, Karnataka, India
    At ViewSonic Technologies, we’re passionate about building software that solves problems.We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availabilit...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer - Chaos Management

    Site Reliability Engineer - Chaos Management

    Xebiahosur, tamil nadu, in
    AWS Engineer with strong Python development and Chaos Engineering expertise.The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault toler...Show moreLast updated: 7 days ago