Talent.com
No longer accepting applications
Sr Systems Engineer Linux – AI Infrastructure

Sr Systems Engineer Linux – AI Infrastructure

DC Tech ConsultingDelhi, Delhi, India
30+ days ago
Job description

Position : Senior Linux Administrator – AI / ML Infrastructure

Location : Remote

Experience : 5+ Years

Type : Full-time

Role Overview

We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises Linux servers optimized for AI / ML workloads.

The ideal candidate will have deep expertise in Linux system administration, Kubernetes cluster management, and a strong understanding of data center infrastructure components including servers, networking, storage, and virtualization technologies.

This role requires hands-on experience in automating infrastructure, optimizing performance, and ensuring reliability for high-performance computing (HPC) and AI / ML pipelines.

Key Responsibilities

Deploy, configure, and manage on-premises Linux servers supporting AI / ML workloads.

Set up, manage, and troubleshoot Kubernetes clusters for containerized workloads.

Optimize system and network performance for compute-intensive applications.

Automate provisioning and configuration using Ansible, Terraform, and scripting (Bash / Python).

Administer and monitor data center components such as servers, storage arrays, switches, and power systems.

Ensure system security, patch management, and compliance across environments.

Collaborate with DevOps, Data Science, and AI engineering teams to enable seamless integration with ML pipelines.

Plan and implement scalability strategies, maintaining uptime and redundancy.

Maintain comprehensive documentation of configurations, policies, and network diagrams.

Required Skills & Qualifications

7+ years of experience in Linux system administration (RHEL, Ubuntu, CentOS).

Proven hands-on experience with Kubernetes cluster management (setup, scaling, troubleshooting).

CKA (Certified Kubernetes Administrator) certification is mandatory.

Strong knowledge of data center components – servers, racks, networking switches, storage systems, and virtualization layers.

Experience with Ansible, Terraform, CI / CD pipelines, and infrastructure automation.

Proficiency in scripting languages (Bash, Python).

Understanding of performance tuning, system optimization, and fault diagnosis.

Excellent problem-solving, communication, and collaboration skills.

Preferred / Good to Have

Exposure to NVIDIA GPU management, CUDA environments, and AI / ML compute nodes.

Familiarity with HPC environments and distributed computing frameworks.

Experience managing monitoring systems (Prometheus, Grafana) and backup solutions.

Knowledge of DevOps practices, containerization, and hybrid cloud environments.

Create a job alert for this search

Ai Infrastructure Engineer • Delhi, Delhi, India

Related jobs
  • Promoted
Systems Storage Engineer

Systems Storage Engineer

Tata Consultancy ServicesDelhi, Republic Of India, IN
Should be skilled in more than 3 technologies as mentioned below.Experience handling linux systems, preferably certified Red Hat Linux Admin. Experience working on Pure Storage and Powerflex (ScaleI...Show moreLast updated: 16 days ago
  • Promoted
IT- Sr Systems Engineer

IT- Sr Systems Engineer

ConfidentialNoida
We are seeking a highly skilled Generative AI Developer with 7 to 10 years of experience to join our team.The ideal candidate will have extensive experience in working with large language models (L...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Systems Engineer

Sr. Systems Engineer

ConfidentialNoida
Plan, administer, troubleshoot, and assist in the design of high availability systems and services in production and lab / test environments. .Analyze and resolve complex problems including interopera...Show moreLast updated: 30+ days ago
  • Promoted
Sr. Solutions Eng

Sr. Solutions Eng

Tata Consultancy ServicesDelhi, India
Greetings from Tata Consultancy Services!!!.Required Experience : 10+ years Location : .Atlassian, Jira, Confluence Key Responsibilities Platform Configuration & Administration Configure and manage Ji...Show moreLast updated: 7 days ago
  • Promoted
Sr Systems Engineer Linux – AI Infrastructure

Sr Systems Engineer Linux – AI Infrastructure

DC Tech ConsultingDelhi, India
Position : Senior Linux Administrator – AI / ML Infrastructure.We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises...Show moreLast updated: 30+ days ago
  • Promoted
Sr Ai Engineer

Sr Ai Engineer

Litmus7Delhi, Republic Of India, IN
As part of this initiative, resource should research and experiment with the latest AI and cloud innovations (such as AWS Agents, Databricks AI, and other Model Context Protocol (MCP integrations),...Show moreLast updated: 7 days ago
  • Promoted
Platform / Distributed Systems Engineer

Platform / Distributed Systems Engineer

whitetable.aiGurgaon
Description : Job Title : Platform Engineer / Distributed Systems Engineer Location : Full Time, In Office (Gurugram / Benga...Show moreLast updated: 30+ days ago
  • Promoted
Sr Engineer II

Sr Engineer II

AristocratGurgaon, Haryana, India
This job is with Aristocrat, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.Aristocrat le...Show moreLast updated: 13 days ago
  • Promoted
Cloud Systems Engineer

Cloud Systems Engineer

InfogainNoida, Republic Of India, IN
Job Summary : Cloud engineer (6-8 Years).AWS Core Services – EC2, S3, EKS, IAM, VPC.Devops – Jenkins, Github, Unix.Devops Integrations – Sonar, NexusIQ, etc.Show moreLast updated: 16 days ago
  • Promoted
Sr. Engineer I - Systems

Sr. Engineer I - Systems

NewSpace Research and TechnologiesDelhi, India, India
We are a start-up based out of Bengaluru & Delhi NCR.We are engaged in the development of next-generation missions and technologies (NGM&T) for future warfare needs of the Indian Defense forces.It ...Show moreLast updated: 7 days ago
  • Promoted
Sr AI Engineer

Sr AI Engineer

Litmus7delhi, delhi, in
As part of this initiative, resource should research and experiment with the latest AI and cloud innovations (such as AWS Agents, Databricks AI, and other Model Context Protocol (MCP integrations),...Show moreLast updated: 7 days ago
  • Promoted
Sr. System Engineer

Sr. System Engineer

eSecDelhi, India
Ahmedabad, Mumbai, Lucknow, Trivandrum.Candidates from the CCTV / Surveillance, Access Control, Smart / Safe City Projects, IT Networking, and Hardware fields are highly preferred.About the Company eSe...Show moreLast updated: 5 days ago
  • Promoted
Platform Engineer / Distributed Systems Engineer

Platform Engineer / Distributed Systems Engineer

whitetable.aiGurgaon
Description : Job Title : Platform Engineer / Distributed Systems Engineer Location : Full Time, In Office (Gurugram / Bengalu...Show moreLast updated: 14 days ago
  • Promoted
Sr. System Software Engineer

Sr. System Software Engineer

ConfidentialGurgaon / Gurugram, India
Orange Business is a network and digital integrator that understands the entire value chain of the digital world, freeing our customers to focus on the strategic initiatives that shape their busine...Show moreLast updated: 1 day ago
  • Promoted
Senior AI Engineer - LLM & RAG Systems

Senior AI Engineer - LLM & RAG Systems

ConfidentialGurgaon / Gurugram, India
An SF based startup is looking to hire a Senior AI Engineer.Here's What The Day-to-day Responsibilities Will Look Like -. Design and implement conversational AI agents using LLMs.Build RAG pipelines...Show moreLast updated: 11 days ago
  • Promoted
L3 / L4 Infra Support Engineer

L3 / L4 Infra Support Engineer

Consolidated AnalyticsDelhi, India
Job description : L3 / L4 Infra Support Engineer - Windows / Azure.Systems Administrator to join our Systems Team to help design, implement, maintain, and support our growing server infrastructure in th...Show moreLast updated: 4 days ago
  • Promoted
AI System Engineer - Python

AI System Engineer - Python

Terrabase.aiDelhi, IN
Description : Experience : More than 5 years of shipping production AI or machine learning systems and scaling data-intensive back ends.Why This ...Show moreLast updated: 11 days ago
  • Promoted
Linux SME

Linux SME

KyndrylGreater Noida, Uttar Pradesh, India
This job is with Kyndryl, an inclusive employer and a member of myGwork – the largest global platform for the LGBTQ+ business community. Please do not contact the recruiter directly.Who We Are At Ky...Show moreLast updated: 16 days ago