AIOps Engineer

T3 Strategic PartnersHyderabad

23 hours ago

Job description

Description : AI Ops Engineer - (ML Ops & LLM Ops).

Location : Hyderabad or Remote (within India).

Experience : 11- 14 Years.

IMMEDIATE JOINERS PREFERRED FROM IT SERVICES ORGANIZATION.

Role Overview :

We are looking for experienced AI Ops Engineers with deep expertise in MLOps, LLM deployment, and AI infrastructure management.

The ideal candidate will design and operate robust pipelines that support the full lifecycle of Large Language Models (LLMs) from training and fine-tuning to production deployment, monitoring, and optimization.

Key Responsibilities :

Design, implement, and maintain CI / CD pipelines for LLM training, fine-tuning, evaluation, and deployment.
Integrate tools like SonarQube and Checkmarx to enforce code quality and security standards.
Establish comprehensive versioning for models, datasets, prompts, and configurations to ensure traceability.
Package and deploy LLMs using Docker and Kubernetes for scalable, consistent runtime environments.
Set up end-to-end monitoring for model performance metricslatency, throughput, cost, and output quality (hallucination, coherence, safety).
Implement alerting mechanisms to detect anomalies, performance degradation, and model drift.
Manage and fine-tune cloud infrastructure (AWS, Azure) and GPU / TPU environments for optimal performance.
Use Terraform or CloudFormation for automated environment provisioning and configuration management.
Apply cost optimization strategies for LLM inference and serving while maintaining reliability.
Architect systems for high availability, fault tolerance, and resilience in AI workloads.
Diagnose and resolve infrastructure or model-related issues in production environments.
Contribute to frameworks ensuring model explainability, fairness, and traceability.
Automate data ingestion, retraining triggers, and pipeline orchestration using modern MLOps tools.
Build and manage complex LLM workflows through orchestration platforms for efficient end-to-end operations.
Continuously monitor and address model degradation, data drift, and other operational risks.

Required Skills & Experience :

10+ years in DevOps, ML Ops, or AI Infrastructure roles.

Strong hands-on experience with LLM deployment, MLOps frameworks, and cloud platforms (AWS, Azure).

Proficiency in Docker, Kubernetes, Terraform, and CI / CD tools (Jenkins, GitLab CI / CD, etc.)

Deep understanding of LLM lifecycle management, performance tuning, and observability.

Knowledge of security and compliance for AI systems.

Experience with GPU / TPU optimization and cost-efficient scaling.

Proven problem-solving and incident management abilities.

Strong communication and cross-functional collaboration skills.

Exposure to generative AI, prompt engineering, or RLHF pipelines.

Familiarity with LLM-specific monitoring tools and safety frameworks.

Open-source contributions in MLOps or AI Ops are a plus.

Certifications in Cloud (AWS / Azure) or DevOps practices preferred.

(ref : hirist.tech)

Create a job alert for this search

Engineer • Hyderabad

Related jobs

Promoted

AWS DevOps Lead

Cognida.aiHyderabad, Telangana, India

Drive revenue growth, increase profitability and improve operational efficiencies.Forever curious, always on the front lines of technological advancements. Applying our latest learnings, and tools t...Show moreLast updated: 13 days ago

Promoted

TechOps Engineer

Aquanowhyderabad, India

Aquanow is a trading and technology company powering the next generation of financial services.We’re at the forefront of the rapidly evolving digital asset space, empowering businesses to navigate ...Show moreLast updated: 7 days ago

Promoted

AWS Operations Engineer

ConfidentialHyderabad / Secunderabad, Telangana, Delhi

As an Operations Engineer, you are a proactive and detail-oriented professional with a strong foundation in cloud technologies, particularly AWS. You thrive in a collaborative environment and are co...Show moreLast updated: 12 days ago

Promoted
New!

MLOps Engineer

X4 TechnologyHyderabad, Telangana, India

MLOps Engineer - Role & Responsibilities Design, deploy and manage scalable & secure cloud infrastructure Apply least privilege across cloud platforms (Azure, RBAC, AWS IAM) Enable audit logging co...Show moreLast updated: 10 hours ago

Promoted

AWS DevOps Engineer - Cloud Infrastructure

DeqodeHyderabad

Job Title : Platform Engineer Experience Level : 5+ Years Location : Bangalore, Pune, Hyderabad, Chennai a...Show moreLast updated: 30+ days ago

Promoted

AAPMOR - DevOps Engineer - CI / CD Pipeline

AAPMORHyderabad

About the Role : We are seeking an experienced DevOps Engineer with strong expertise in Azure and AWS cloud platforms.The role involves designing, automating, and op...Show moreLast updated: 5 days ago

Promoted

DevOps Engineer

ESB TechnologiesHyderabad, IN

DevOps Engineer – Cloud Infrastructure (AWS / Azure, Terraform).A large-scale digital modernization program is underway to unify inventory, pricing, and e-commerce across. We’re looking for a Senior D...Show moreLast updated: 14 days ago

Promoted

AWS / DevOps Engineer - Cloud Infrastructure

FinJoHyderabad

We are seeking a hands-on AWS DevOps Engineer with 4+ years of experience in managing cloud infrastructure and integrating secure DevOps practices. This role demands high-speed execution in a fastpa...Show moreLast updated: 30+ days ago

Promoted
New!

AiOps Engineer

L&T Technology ServicesSecunderabad, Telangana, India

Only immediate to 15 days joiner Experience – 8 to 10 yrs.Key Responsibilities : Develop and Deploy AI Solutions : .Design, build, and deploy end-to-end Machine Learning and Generative AI pipelines on...Show moreLast updated: 10 hours ago

Promoted

DevOps Engineer

SID Global Solutionshyderabad, India

Job Description : DevOps Engineer.Design, implement, and maintain CI / CD pipelines to automate build, test, and deployment processes across multiple client projects. Develop and manage infrastructure ...Show moreLast updated: 7 days ago

Promoted

DevOps Engineer - AWS Services

Awign Enterprise Pvt ltdHyderabad

Title : DevOps Engineer Location : Hyderabad Experience : 5+ years What Youll...Show moreLast updated: 30+ days ago

Promoted

AWS DevOps Engineer

Charter GlobalHyderabad, Telangana, India

Job Summary : We are looking for a highly skilled AWS DevOps Engineer with hands-on expertise across both AWS and Microsoft Azure. You will be responsible for building secure, scalable, and auto...Show moreLast updated: 22 days ago

Promoted

AWS CDK

ACubetech Solutions Private limitedsecunderabad, India

We are seeking a highly skilled DevOps Engineer with 7+ years hands-on expertise in AWS CDK using TypeScript to help modernize infrastructure, ensure performance, and maintain security for cloud-ho...Show moreLast updated: 5 days ago

Promoted

AWS DevOps Engineer

ConfidentialHyderabad / Secunderabad, Telangana

Design, deploy, and manage scalable cloud infrastructure on AWS using Terraform.Develop automated deployment scripts using Python to ensure efficient provisioning of resources.Collaborate with cros...Show moreLast updated: 30+ days ago

Promoted

DevOps Manager

Unified InfotechHyderabad, IN

We are seeking a highly skilled and motivated.AWS and Azure cloud platforms to join our dynamic team.The successful candidate will collaborate with solution architects, developers, project managers...Show moreLast updated: 14 days ago

Promoted

Techdome - DevOps Engineer - AWS

TechdomeHyderabad

Key Responsibilities : - Manage and optimize AWS cloud infrastructure ensuring scalability, security, and reliability.Design, implement, a...Show moreLast updated: 30+ days ago

Promoted

AWS DevOps Engineer

Awign Enterprise Pvt ltdHyderabad

About the client : We are a world-changing team of AI researchers and engineers working on the cutting edge of generative AI. We are building systems that work across ...Show moreLast updated: 5 days ago

Promoted

Azure & AWS DevOps Engineer

ConfidentialHyderabad / Secunderabad, Telangana

New Relic monitoring, DocFx, and Azure DevOps processes.The ideal candidate will also possess strong skills in.Additionally, familiarity with application development knowledge on.Cloud Infrastructu...Show moreLast updated: 30+ days ago