Talent.com
Sr Systems Engineer Linux – AI Infrastructure
Sr Systems Engineer Linux – AI InfrastructureDC Tech Consulting • Saint Thomas Mount, Tamil Nadu, India
No longer accepting applications
Sr Systems Engineer Linux – AI Infrastructure

Sr Systems Engineer Linux – AI Infrastructure

DC Tech Consulting • Saint Thomas Mount, Tamil Nadu, India
30+ days ago
Job description

Position : Senior Linux Administrator – AI / ML Infrastructure

Location : Remote

Experience : 5+ Years

Type : Full-time

Role Overview

We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises Linux servers optimized for AI / ML workloads.

The ideal candidate will have deep expertise in Linux system administration, Kubernetes cluster management, and a strong understanding of data center infrastructure components including servers, networking, storage, and virtualization technologies.

This role requires hands-on experience in automating infrastructure, optimizing performance, and ensuring reliability for high-performance computing (HPC) and AI / ML pipelines.

Key Responsibilities

Deploy, configure, and manage on-premises Linux servers supporting AI / ML workloads.

Set up, manage, and troubleshoot Kubernetes clusters for containerized workloads.

Optimize system and network performance for compute-intensive applications.

Automate provisioning and configuration using Ansible, Terraform, and scripting (Bash / Python).

Administer and monitor data center components such as servers, storage arrays, switches, and power systems.

Ensure system security, patch management, and compliance across environments.

Collaborate with DevOps, Data Science, and AI engineering teams to enable seamless integration with ML pipelines.

Plan and implement scalability strategies, maintaining uptime and redundancy.

Maintain comprehensive documentation of configurations, policies, and network diagrams.

Required Skills & Qualifications

7+ years of experience in Linux system administration (RHEL, Ubuntu, CentOS).

Proven hands-on experience with Kubernetes cluster management (setup, scaling, troubleshooting).

CKA (Certified Kubernetes Administrator) certification is mandatory.

Strong knowledge of data center components – servers, racks, networking switches, storage systems, and virtualization layers.

Experience with Ansible, Terraform, CI / CD pipelines, and infrastructure automation.

Proficiency in scripting languages (Bash, Python).

Understanding of performance tuning, system optimization, and fault diagnosis.

Excellent problem-solving, communication, and collaboration skills.

Preferred / Good to Have

Exposure to NVIDIA GPU management, CUDA environments, and AI / ML compute nodes.

Familiarity with HPC environments and distributed computing frameworks.

Experience managing monitoring systems (Prometheus, Grafana) and backup solutions.

Knowledge of DevOps practices, containerization, and hybrid cloud environments.

Create a job alert for this search

Ai Infrastructure Engineer • Saint Thomas Mount, Tamil Nadu, India

Related jobs
Sr Systems Engineering

Sr Systems Engineering

Confidential • Chennai, India
We're seeking a highly skilled and experienced.The ideal candidate will have over.This role requires deep expertise in.Pure Storage administration and design. Cohesity backup and recovery solutions....Show more
Last updated: 12 days ago • Promoted
AI Infrastructure Architect

AI Infrastructure Architect

The Adecco Group • Chennai, Tamil Nadu, India
Design and implement large-scale AI / ML infrastructure solutions using NVIDIA GPU clusters, SMCI server platforms, and high-performance computing architectures to support enterprise AI workloads.Lea...Show more
Last updated: 12 days ago • Promoted
Technical Lead - AWS IaaS Infra Engineer

Technical Lead - AWS IaaS Infra Engineer

HCLTech • Chennai, Tamil Nadu, India
AWS IaaS Infrastructure Engineer.This role ensures high availability, security, and scalability of cloud environments, supports automation and integration with CI / CD pipelines, and provides operati...Show more
Last updated: 11 days ago • Promoted
Sr AI Solutions Engineer

Sr AI Solutions Engineer

Centific Global Solutions, Inc. • India Office - Chennai
In this role, you will provide regular updates on solutions and benefits / feature scope to Leadership and stakeholders.You'll apply NVIDIA processing and interconnect products such as NeMo, Riva, RA...Show more
Last updated: 15 days ago
Senior AI Platform DevOps Engineer

Senior AI Platform DevOps Engineer

Cloudely, Inc • Chennai, IN
AI Platform DevOpes / SRE Engineer.Responsibilities / What You’ll Do.Platform Design and Architecture : building and operating a highly available, scalable, modular AI platform using technologies such ...Show more
Last updated: 2 days ago • Promoted
System Engineer II - SE 2

System Engineer II - SE 2

Straive • Chennai, IN
LearningMate / Straive and MGT Impact Solutions, LLC (MGT) have established a strategic global partnership designed to deliver world-class advisory, technology, and operational solutions for public s...Show more
Last updated: 11 days ago • Promoted
Unix System Administrator

Unix System Administrator

Tata Consultancy Services • Chennai, IN
TCS present an excellent opportunity for Unix (AIX, Linux) administrator.Interview date : 11-Dec-25 (Thursday).Strong Knowledge and experience in. Installing, Configuring and Troubleshooting.VIO serv...Show more
Last updated: 3 hours ago • Promoted • New!
Specialist Systems Engineer- Control M

Specialist Systems Engineer- Control M

Societe Generale Global Solution Centre • Chennai, Tamil Nadu, India
L2 / L3 level Control-M Administration and scheduling, Maintenance, Enhancement, Production / Application Support, and Infrastructure Administration. Must be Infrastructure Administrator of Control-M an...Show more
Last updated: 9 days ago • Promoted
Site Reliability Engineer

Site Reliability Engineer

Datum Technologies Group • Chennai, Tamil Nadu, India
Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show more
Last updated: 26 days ago • Promoted
L3 / L4 Infra Support Engineer

L3 / L4 Infra Support Engineer

Consolidated Analytics • Chennai, Tamil Nadu, India
Job description : L3 / L4 Infra Support Engineer - Windows / Azure.Systems Administrator to join our Systems Team to help design, implement, maintain, and support our growing server infrastructure in th...Show more
Last updated: 30+ days ago • Promoted
Agentic & AI Tech Ops Engineer

Agentic & AI Tech Ops Engineer

Insight Global • Chennai, IN
Agentic & AI Tech Ops Engineer.Agentic & AI Tech Ops Engineer.AI and Agentic AI systems in production.You will manage deployments, monitor performance, troubleshoot issues, and implement best pract...Show more
Last updated: 12 days ago • Promoted
Senior Generative AI Engineer / AI Engineer

Senior Generative AI Engineer / AI Engineer

Unity Systems • Chennai, IN
Client is at the forefront of AI innovation, leveraging cutting-edge technology to transform legacy systems into modern, efficient, and scalable solutions. We work with enterprise clients to breathe...Show more
Last updated: 3 hours ago • Promoted • New!
Linux Systems Engineer

Linux Systems Engineer

Jio Platforms Limited (JPL) • Chennai, Tamil Nadu, India
Must have 3- 6 Years of experience in the field of Linux Administration.Mandatory Skills / Knowledge Redhat : - 1.Should have good experience of Linux Administration (OS Installation, Virtualization,...Show more
Last updated: 6 days ago • Promoted
Senior / Principal ASIC RTL Design Engineer (SoC / Subsystem)

Senior / Principal ASIC RTL Design Engineer (SoC / Subsystem)

Proxelera • Chennai, Tamil Nadu, India
My name is Shahid I am reaching out with a role that fits engineers who enjoy real ownership, from shaping micro-architecture to watching their RTL come alive in silicon. If you’re looking for a spa...Show more
Last updated: 20 days ago • Promoted
Sr. Site Reliability Engineer (SRE)

Sr. Site Reliability Engineer (SRE)

Datum Technologies Group • Chennai, Tamil Nadu, India
Site Reliability Engineer (SRE).Duration : Contract to Hire (On the Payroll of Datum Technology Group).Location : Chennai || Mumbai || Gurugram. Interview Process : Virtual (2 Rounds) + 1 Technical scr...Show more
Last updated: 10 days ago • Promoted
Linux System Administrator (AWS Specialist)

Linux System Administrator (AWS Specialist)

MGT-COMMERCE GmbH • Chennai, IN
MGT-Commerce GmbH specializes in helping Magento shops achieve optimal performance through Managed Cloud Hosting solutions powered by Amazon Web Services (AWS). Founded in 2010 and located in Berlin...Show more
Last updated: 30+ days ago • Promoted
Sr.Software Engineer

Sr.Software Engineer

Zupee • chennai, India
We're seeking 2 Senior Software Engineers who live and breathe AI to architect and build the technical foundation of our AI Experiences platform. You'll be instrumental in creating robust, scalable ...Show more
Last updated: 27 days ago • Promoted
Linux Engineer

Linux Engineer

CBTS • Chennai, Tamil Nadu, India
CBTS has an opening for a candidate with 7+ years’ experience managing Linux operating systems.Essential Technical Qualifications : . Linux (RedHat 6 / 7 / 8, CentOS) required; additional Unix (Solaris / AI...Show more
Last updated: 30+ days ago • Promoted