Talent.com
ML Infrastructure Architect
ML Infrastructure ArchitectQuantiphi • Bengaluru, Republic Of India, IN
ML Infrastructure Architect

ML Infrastructure Architect

Quantiphi • Bengaluru, Republic Of India, IN
30+ days ago
Job description

Role : Associate Architect - MLOps / LLMOps

Experience : 6 to 8 Years

Location : Bangalore / Mumbai (Hybrid)

Job Summary :

Join our dynamic team as a Platform Architect and leverage your expertise in production-scale platforms within the GenAI or ML domain . In this role, you'll be instrumental in designing, developing and maintaining cutting-edge build and test environments for critical GenAI workloads running on foundational cloud infrastructure.

You'll partner with architects to design and implement highly robust and scalable systems, while also providing crucial development support to SRE / Operations teams as they tackle complex distributed systems challenges at scale. We're seeking an engineer who champions Quantiphi's dedication to Cloud-Native development , with a particular emphasis on Kubernetes .

Job Responsibilities :

As a Platform Architect , you will play a pivotal role in designing, implementing, and optimizing our cutting-edge infrastructure. Your responsibilities will include :

  • Designing and implementing state-of-the-art GPU compute clusters to support critical workloads.
  • Designing comprehensive automated testing strategies and frameworks across unit, integration, API, and end-to-end levels for critical commerce flows.
  • Developing robust performance testing frameworks to validate platform scalability, resilience, and identify optimization opportunities.
  • Planning of comprehensive monitoring solutions with alerting systems to track platform health and ensure SLA compliance.
  • Designing specialized test frameworks for security controls and ensuring compliance validation across payment and personal data.
  • Architecting a scalable automation infrastructure that supports growing platform capabilities with consistent test environments.
  • Troubleshooting, diagnosing, and performing root cause analysis of system failures, isolating components and failure scenarios in collaboration with internal and external partners.
  • Optimizing cluster operations for maximum reliability, efficiency, and performance.

Job Requirements :

We are seeking a highly skilled and passionate Platform Engineer with :

  • Over 6-8 years of experience working with developing ML Infrastructure.
  • Over 3 years of hands-on experience in large-scale direct experience building and deploying production-ready services on Kubernetes.
  • A proven history of engaging with and contributing to open-source projects .
  • A collaborative spirit , demonstrated by prior work developing scalable software solutions for cloud services.
  • The ability to effectively communicate complex technical designs and quality approaches across various mediums.
  • A deep understanding of GPU computing and AI infrastructure .
  • A strong passion for solving complex technical challenges and optimizing system performance.
  • Working knowledge of cluster configuration management tools such as BCM or Ansible, and infrastructure-level applications including Kubernetes, Terraform, and MySQL.
  • In-depth understanding of container technologies like Docker and Containers.
  • Proficiency in programming with Python and Bash scripting.
  • Ways To Stand Out From The Crowd :

    Candidates who possess the following will be highly competitive :

  • Significant experience with sophisticated infrastructure tooling , including Kubernetes Cluster API, Terraform, Helm, and Operator Framework.
  • Practical, production-level experience across major cloud platforms : Azure, Google Cloud Platform (GCP), or Amazon Web Services (AWS).
  • Ability to adapt to new technologies and Frameworks in ML / GenAI landscape.
  • A strong track record of successfully refactoring and optimising software for deployment within Kubernetes environments .
  • Comfort discussing and working with core Kubernetes concepts like CSI, CNI, and CRI .
  • Comprehensive understanding of the CNCF landscape and its associated tooling.
  • The ability to decompose complex problems into simpler sub-problems and leverage existing solutions for efficient implementation, along with designing simple, self-sustaining systems.
  • Experience leveraging AI / ML to proactively detect and resolve incidents , automate alert triaging, perform log analysis, and streamline repetitive workflows.
  • Create a job alert for this search

    Infrastructure Architect • Bengaluru, Republic Of India, IN

    Related jobs
    Infrastructure Manager

    Infrastructure Manager

    ITC Infotech • Bengaluru, Karnataka, India
    Job Title : Technical Release Manager – Microsoft Ecosystem (Windows OS, M365, W365, Autopilot).End User Computing (EUC) / IT Infrastructure. We are looking for a detail-oriented and technically prof...Show more
    Last updated: 19 days ago • Promoted
    Cloud Orchestration & Scheduling Architect (AI Infrastructure)

    Cloud Orchestration & Scheduling Architect (AI Infrastructure)

    Sustainability Economics.ai • Bengaluru, Karnataka, India
    AI, enabling profitable energy transitions while powering end-to-end AI infrastructure.By integrating AI-driven cloud solutions with sustainable energy, we create scalable, intelligent ecosystems t...Show more
    Last updated: 16 days ago • Promoted
    Cloud Infrastructure Architect

    Cloud Infrastructure Architect

    Confidential • Bengaluru, Republic Of India, IN
    Cloud Enterprise Architect (15-20 Year Experience).Required an experienced Enterprise Architect who designs and oversees an organization’s cloud computing strategy, defines Roadmap, ensuring alignm...Show more
    Last updated: 15 days ago • Promoted
    MLOps & AI Infrastructure Engineer – Scalable LLM Deployment (Telecom)

    MLOps & AI Infrastructure Engineer – Scalable LLM Deployment (Telecom)

    Mobileum • Bengaluru, Karnataka, India
    Mobileum is a leading provider of Telecom analytics solutions for roaming, core network, security, risk management, domestic and international connectivity testing, and customer intelligence.More t...Show more
    Last updated: 15 days ago • Promoted
    Cloud Infrastructure Architect

    Cloud Infrastructure Architect

    Getronics • Bengaluru, Republic Of India, IN
    An exciting opportunity to join our team within a growing ICT Services company with a global portfolio, as an Enterprise Architect Team Purpose This is a high level, prestigious position in the Clo...Show more
    Last updated: 22 days ago • Promoted
    Cloud Infrastructure Architect

    Cloud Infrastructure Architect

    NAM Info Inc • Bengaluru, Republic Of India, IN
    Please find the Job description.Kindly share resume along with last working day or notice period to join, your Current CTC (Fixed and Variable) and Expected CTC send it to manju@nam-it.Years and 4+...Show more
    Last updated: 13 days ago • Promoted
    Infrastructure Manager

    Infrastructure Manager

    Alp Consulting Ltd. • Bengaluru, Karnataka, India
    Good experience in Linux & Windows based systems including performance tuning, networking, debugging and security for mission-critical applications. Hands-on experience in design and maintenance of ...Show more
    Last updated: 1 day ago • Promoted
    Cloud Infrastructure Architect

    Cloud Infrastructure Architect

    Movate • Bengaluru, Republic Of India, IN
    The Technical Architect will lead the design, development, and implementation of scalable, secure, and high-performance applications and infrastructure, leveraging AWS cloud services, modern front-...Show more
    Last updated: 4 days ago • Promoted
    GenAI Infrastructure Architect

    GenAI Infrastructure Architect

    Quantiphi • Bengaluru, Republic Of India, IN
    Role : Architect - Machine Learning.We are looking for an experienced AI / ML Architect (9+ years) with a strong background in GenAI, LLMs, and ML system design to lead end-to-end architecture, design...Show more
    Last updated: 9 days ago • Promoted
    Cloud Infrastructure Architect

    Cloud Infrastructure Architect

    EvoluteIQ • Bengaluru, Republic Of India, IN
    We at EvoluteIQ believe in the power of transformation.We are committed to building an industry-leading technology that will revolutionize the way enterprises conduct business.To make that happen, ...Show more
    Last updated: 7 days ago • Promoted
    Lead Cloud Infrastructure Architect (Contract)

    Lead Cloud Infrastructure Architect (Contract)

    Epsilon • Bengaluru, Republic Of India, IN
    We are looking for an expert cloud system engineer for one of our global healthcare enterprise customers.The role “AWS System Architect” is a key role in building and implementing customer infrastr...Show more
    Last updated: 7 days ago • Promoted
    LLM Deployment Architect

    LLM Deployment Architect

    Mobileum • Bengaluru, Republic Of India, IN
    Mobileum is a leading provider of Telecom analytics solutions for roaming, core network, security, risk management, domestic and international connectivity testing, and customer intelligence.More t...Show more
    Last updated: 15 days ago • Promoted
    Infrastructure Architect

    Infrastructure Architect

    Tata Consultancy Services • Bengaluru, Karnataka, India
    TCS has been a great pioneer in feeding the fire of Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us from growing together.Your role is of key im...Show more
    Last updated: 2 days ago • Promoted
    Cloud Infrastructure Architect

    Cloud Infrastructure Architect

    Comviva • Bengaluru, Republic Of India, IN
    Role : Solution Architect – Cloud.The Cloud Solutions Architect will play a pivotal role in designing, implementing, and optimizing cloud-based solutions for diverse business needs.This role emphasi...Show more
    Last updated: 5 days ago • Promoted
    Lead Cloud Infrastructure Architect

    Lead Cloud Infrastructure Architect

    Licious • Bengaluru, Republic Of India, IN
    DevOps 3 – Lead DevOps Engineer / Infrastructure Architect.Kubernetes, platform reliability, and automation.This role requires strong architectural thinking, hands-on technical depth, and the abili...Show more
    Last updated: 4 days ago • Promoted
    ML Infrastructure Engineer

    ML Infrastructure Engineer

    Prospance Inc • Bengaluru, Republic Of India, IN
    SRE & DevOps Engineer (ML / AI Platform).Contract Position | Global E-Commerce Leader | Hybrid.SRE & DevOps Engineer to join their AI Platform Team. This is your chance to shape the future of machine ...Show more
    Last updated: 30+ days ago • Promoted
    Principal Infrastructure Architect

    Principal Infrastructure Architect

    IBM • Bengaluru, Republic Of India, IN
    This engineering role is part of a growing team focused on infrastructure touching multiple engineering domains.Success in this role requires collaboration skills, a product-driven mindset, and com...Show more
    Last updated: 22 days ago • Promoted
    Senior Network Infrastructure Architect

    Senior Network Infrastructure Architect

    Sonata Software • Bengaluru, Karnataka, India
    Senior Network Infrastructure Architect.Design, implement, and troubleshoot complex.Provide architectural-level guidance and solutions for scalable and secure network systems.Collaborate with cross...Show more
    Last updated: 19 days ago • Promoted