Talent.com
Senior Infrastructure Engineer (MLOps / LLMOps)

Senior Infrastructure Engineer (MLOps / LLMOps)

QuantiphiBengaluru, Republic Of India, IN
30+ days ago
Job description

Role : Senior Platform Engineer (MLOps / LLMOps)

Experience : 3 to 6 Years

Location : Bangalore / Mumbai / Trivandrum (Hybrid)

Job Summary :

Join our dynamic team as a Platform Engineer and leverage your expertise in production-scale platforms within the GenAI or ML domain . In this role, you'll be instrumental in developing and maintaining cutting-edge build and test environments for critical GenAI workloads running on foundational cloud infrastructure.

You'll partner with architects to design and implement highly robust and scalable systems, while also providing crucial development support to SRE / Operations teams as they tackle complex distributed systems challenges at scale. We're seeking an engineer who champions Quantiphi's dedication to Cloud-Native development , with a particular emphasis on Kubernetes .

Job Responsibilities :

As a Platform Engineer , you will play a pivotal role in designing, implementing, and optimizing our cutting-edge infrastructure. Your responsibilities will include :

  • Implementing state-of-the-art GPU compute clusters to support critical workloads.
  • Developing comprehensive automated testing strategies and frameworks across unit, integration, API, and end-to-end levels for critical commerce flows.
  • Ability to create robust performance testing frameworks to validate platform scalability, resilience, and identify optimization opportunities.
  • Experience in developing comprehensive monitoring solutions with alerting systems to track platform health and ensure SLA compliance.
  • Building a scalable automation infrastructure that supports growing platform capabilities with consistent test environments.
  • Troubleshooting, diagnosing, and performing root cause analysis of system failures, isolating components and failure scenarios in collaboration with internal and external partners.
  • Optimizing cluster operations for maximum reliability, efficiency, and performance.

Job Requirements :

We are seeking a highly skilled and passionate Platform Engineer with :

  • Over 3 years of hands-on experience in large-scale direct experience building and deploying production-ready services on Kubernetes.
  • A proven history of engaging with and contributing to open-source projects .
  • A collaborative spirit , demonstrated by prior work developing scalable software solutions for cloud services.
  • The ability to effectively communicate complex technical designs and decisions with internal team.
  • An understanding of GPU computing and AI infrastructure .
  • A strong passion for solving complex technical challenges and optimizing system performance.
  • Working knowledge of cluster configuration management tools such as BCM or Ansible, and infrastructure-level applications including Kubernetes, Terraform, and MySQL.
  • In-depth understanding of container technologies like Docker and Containers.
  • Proficiency in programming with Python and Bash scripting.
  • Ways To Stand Out From The Crowd :

    Candidates who possess the following will be highly competitive :

  • Significant experience with sophisticated infrastructure tooling , including Kubernetes Cluster API, Terraform, Helm, and Operator Framework.
  • Practical, production-level experience across major cloud platforms : Azure Cloud, Google Cloud Platform (GCP), or Amazon Web Services (AWS).
  • A strong track record of successfully refactoring and optimizing software for deployment within Kubernetes environments .
  • Understanding of the CNCF landscape and its associated tooling.
  • The ability to decompose complex problems into simpler sub-problems and leverage existing solutions for efficient implementation, along with designing simple, self-sustaining systems.
  • Experience leveraging AI / ML to proactively detect and resolve incidents , automate alert triaging, perform log analysis, and streamline repetitive workflows.
  • Create a job alert for this search

    Senior Infrastructure Engineer • Bengaluru, Republic Of India, IN

    Related jobs
    • Promoted
    • New!
    Senior Cloud Infrastructure Engineer (EKS)

    Senior Cloud Infrastructure Engineer (EKS)

    FICOBengaluru, Republic Of India, IN
    FICO is seeking a senior AWS Cloud Engineer who thrives working in a fast paced state of the art Cloud environment.This position will be heavily involved with the migration of our existing products...Show moreLast updated: 10 hours ago
    • Promoted
    Senior DevOps Engineer - Cloud Infrastructure

    Senior DevOps Engineer - Cloud Infrastructure

    EduRunBangalore
    Job Description : Key Responsibilities : - Design, im...Show moreLast updated: 27 days ago
    • Promoted
    Cloud Infrastructure Support Engineer

    Cloud Infrastructure Support Engineer

    WhiteLotus Talent PartnersBengaluru, Republic Of India, IN
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    Senior DevOps Engineer

    Senior DevOps Engineer

    RadwareGreater Bengaluru Area, India
    Radware is a global leader of cyber security and application delivery solutions for physical, cloud, and software defined data centers. At Radware, we live and breathe cybersecurity.Each day, our in...Show moreLast updated: 16 days ago
    • Promoted
    AspenTech - Senior DevOps Engineer - System Infrastructure

    AspenTech - Senior DevOps Engineer - System Infrastructure

    Aspen TechnologyBangalore
    The Role : As a Senior DevOps Engineer, you will play a key role in designing, deploying, and supporting DGMs control systems infrastructure.Youll...Show moreLast updated: 30+ days ago
    • Promoted
    DevOps Engineer - Cloud Infrastructure

    DevOps Engineer - Cloud Infrastructure

    MagnifireBangalore
    Description : We are seeking a DevOps Engineer who is passionate about automation, scalability, security, and reliabili...Show moreLast updated: 21 days ago
    • Promoted
    Senior Infrastructure Engineer

    Senior Infrastructure Engineer

    Lowe's IndiaBengaluru, Karnataka, India
    Lowe’s is a FORTUNE® 100 home improvement company serving approximately 16 million customer transactions a week in the United States. With total fiscal year 2024 sales of more than $83 billion, Lowe...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Platform Engineer

    Senior Platform Engineer

    QuantiphiBengaluru, Karnataka, India
    Senior Platform Engineer (MLOps / LLMOps).Bangalore / Mumbai / Trivandrum (Hybrid).In this role, you'll be instrumental in developing and maintaining cutting-edge build and test environments for cr...Show moreLast updated: 30+ days ago
    • Promoted
    Bottomline - Systems Engineer II - Cloud Infrastructure

    Bottomline - Systems Engineer II - Cloud Infrastructure

    bottomlineBangalore
    How You Will Contribute : - Developing, enhancing, and maintaining our core services within our private and public clouds. Building your knowledge and understanding of...Show moreLast updated: 30+ days ago
    • Promoted
    Five9 - Cloud Infrastructure Engineer - DNS Architecture

    Five9 - Cloud Infrastructure Engineer - DNS Architecture

    Five9Bangalore
    Join us in bringing joy to customer experience.Five9 is a leading provider of cloud contact center software, bringing the power of cloud innovation to customers worldwide.Living our values everyday...Show moreLast updated: 30+ days ago
    • Promoted
    Senior AI Infrastructure Engineer

    Senior AI Infrastructure Engineer

    IdeaSouqBengaluru, Karnataka, India
    At IdeaSouq, we are building the.Traditional investment workflows are drowning in data silos, manual screening, and overwhelming deal flow. We're a startup building the solution : an AI analyst that ...Show moreLast updated: 4 days ago
    • Promoted
    Miratech - Senior Cloud Infrastructure Engineer - DevOps

    Miratech - Senior Cloud Infrastructure Engineer - DevOps

    MiratechBangalore
    Job Description : About the Role : We are seeking a highly experienced Cloud Infrastructure Engineer to implement and support ...Show moreLast updated: 24 days ago
    • Promoted
    MLops Engineer

    MLops Engineer

    RecroBangalore, IN
    We are looking for an experienced.Azure and AWS cloud ecosystems.The ideal candidate should bring a strong background in. GenAI tooling, automation, and CI / CD pipelines.Design, implement, and manage...Show moreLast updated: 16 days ago
    • Promoted
    DevOps Engineer

    DevOps Engineer

    Alp Consulting Ltd.Greater Bengaluru Area, India
    Good knowledge of AWS technologies including EC2, ECS / EKS (Docker containers), RDS, S3, Lambda, CloudHSM.Cloud stack deployment & upgrade using CloudFormation / Terraform.REST end point development...Show moreLast updated: 14 days ago
    • Promoted
    MLOps Engineer - Cloud Infrastructure

    MLOps Engineer - Cloud Infrastructure

    ImpacteersBangalore
    Role Overview : As an MLOps Engineer, you will be responsible for bridging the gap between machine learning development and production deployment.You will work closel...Show moreLast updated: 30+ days ago
    • Promoted
    IP Infrastructure Engineer

    IP Infrastructure Engineer

    InfogainBengaluru, Republic Of India, IN
    Excellent communication and collaboration skills.Interested candidates can share resume on email id- arti.Show moreLast updated: 16 days ago
    • Promoted
    Senior Cloud Infrastructure Engineer

    Senior Cloud Infrastructure Engineer

    Albertsons Companies IndiaBengaluru, Republic Of India, IN
    About Albertsons Companies Inc.As a leading food and drug retailer in the United States, Albertsons Companies, Inc.Our well-known banners across the United States, including Albertsons, Safeway, Vo...Show moreLast updated: 30+ days ago
    • Promoted
    Senior DevOps Engineer - Cloud Infrastructure

    Senior DevOps Engineer - Cloud Infrastructure

    MNR SolutionsBangalore
    Description : Key Responsibilities : 1.Cloud Infrastructure & Automation - Design, provision, and maintain scalable GCP ...Show moreLast updated: 3 days ago