Talent.com
Sr Systems Engineer Linux – Ai Infrastructure

Sr Systems Engineer Linux – Ai Infrastructure

DC Tech ConsultingErode, Republic Of India, IN
1 day ago
Job description

Position : Senior Linux Administrator – AI / ML Infrastructure

Location : Remote

Experience : 5+ Years

Type : Full-time

Role Overview

We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises Linux servers optimized for AI / ML workloads.

The ideal candidate will have deep expertise in Linux system administration, Kubernetes cluster management, and a strong understanding of data center infrastructure components including servers, networking, storage, and virtualization technologies.

This role requires hands-on experience in automating infrastructure, optimizing performance, and ensuring reliability for high-performance computing (HPC) and AI / ML pipelines.

Key Responsibilities

Deploy, configure, and manage on-premises Linux servers supporting AI / ML workloads.

Set up, manage, and troubleshoot Kubernetes clusters for containerized workloads.

Optimize system and network performance for compute-intensive applications.

Automate provisioning and configuration using Ansible, Terraform, and scripting (Bash / Python).

Administer and monitor data center components such as servers, storage arrays, switches, and power systems.

Ensure system security, patch management, and compliance across environments.

Collaborate with DevOps, Data Science, and AI engineering teams to enable seamless integration with ML pipelines.

Plan and implement scalability strategies, maintaining uptime and redundancy.

Maintain comprehensive documentation of configurations, policies, and network diagrams.

Required Skills & Qualifications

7+ years of experience in Linux system administration (RHEL, Ubuntu, CentOS).

Proven hands-on experience with Kubernetes cluster management (setup, scaling, troubleshooting).

CKA (Certified Kubernetes Administrator) certification is mandatory.

Strong knowledge of data center components – servers, racks, networking switches, storage systems, and virtualization layers.

Experience with Ansible, Terraform, CI / CD pipelines, and infrastructure automation.

Proficiency in scripting languages (Bash, Python).

Understanding of performance tuning, system optimization, and fault diagnosis.

Excellent problem-solving, communication, and collaboration skills.

Preferred / Good to Have

Exposure to NVIDIA GPU management, CUDA environments, and AI / ML compute nodes.

Familiarity with HPC environments and distributed computing frameworks.

Experience managing monitoring systems (Prometheus, Grafana) and backup solutions.

Knowledge of DevOps practices, containerization, and hybrid cloud environments.

Create a job alert for this search

Engineer Ai Infrastructure • Erode, Republic Of India, IN

Related jobs
  • Promoted
Linux System Administrator (AWS Specialist)

Linux System Administrator (AWS Specialist)

MGT-COMMERCE GmbHSalem,Tamil Nadu, IN
Do you live and breathe Linux? Do you enjoy building and managing servers in the cloud?.Linux-focused System Administrator. AWS infrastructure and keep systems running at peak performance.Setting up...Show moreLast updated: 30+ days ago
  • Promoted
Senior Cloud IAM Engineer (AWS / Okta)

Senior Cloud IAM Engineer (AWS / Okta)

Vertex AgilityTiruppur, IN
Senior Cloud IAM Engineer (AWS / Okta) – Remote.Vertex Agility | Agile On-Demand Solutions.Are you passionate about identity management and cloud security? Vertex Agility is looking for a Senior Cl...Show moreLast updated: 3 days ago
  • Promoted
  • New!
Linux Engineer

Linux Engineer

TerraGiGsalem, tamil nadu, in
Bachelor's degree in Information Technology, Computer Science or a related field or equivalent practical experience.Proven experience as a Linux architect, systems engineer, or DevOps engineer in e...Show moreLast updated: 14 hours ago
  • Promoted
DevOps / Platform Engineer

DevOps / Platform Engineer

iVedha Inc.Salem,Tamil Nadu, IN
Hiring a seasoned DevOps / Platform Engineer to drive automation, platform reliability, and robust.Design, deploy, and manage CI / CD pipelines and infrastructure automation, leveraging AI for.Implemen...Show moreLast updated: 30+ days ago
  • Promoted
Linux Engineer

Linux Engineer

RecroSalem,Tamil Nadu, IN
As a Senior Software Engineer at Nasuni, you will play a key role in enhancing our cloud-scale NAS platform.Your responsibilities will include : . Collaborating on requirements analysis, architecture ...Show moreLast updated: 24 days ago
  • Promoted
Sr. / Software Engineer

Sr. / Software Engineer

BrightEdgeerode, tamil nadu, in
BrightEdge is a global leader in enterprise SEO and content performance solutions, driving AI-powered digital marketing success for the world’s top brands. Our culture is product-first, innovation-d...Show moreLast updated: 30+ days ago
  • Promoted
Sr Systems Engineer Linux – Ai Infrastructure

Sr Systems Engineer Linux – Ai Infrastructure

DC Tech ConsultingTiruppur, Republic Of India, IN
Position : Senior Linux Administrator – AI / ML Infrastructure.We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises...Show moreLast updated: 1 day ago
  • Promoted
  • New!
Site Reliability Engineer (SRE) – Infrastructure & Automation

Site Reliability Engineer (SRE) – Infrastructure & Automation

InstaServiceErode, IN
InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 18 hours ago
  • Promoted
Sr AI Engineer

Sr AI Engineer

Litmus7erode, India
As part of this initiative, resource should research and experiment with the latest AI and cloud innovations (such as AWS Agents, Databricks AI, and other Model Context Protocol (MCP integrations),...Show moreLast updated: 2 days ago
  • Promoted
Sr Ai Engineer

Sr Ai Engineer

Litmus7Tiruppur, Republic Of India, IN
As part of this initiative, resource should research and experiment with the latest AI and cloud innovations (such as AWS Agents, Databricks AI, and other Model Context Protocol (MCP integrations),...Show moreLast updated: 2 days ago
  • Promoted
Aix System Administrator

Aix System Administrator

Tata Consultancy ServicesErode, IN
Come and join us for an exciting career with TCS!!!.Must Have Experiences and Skills : .As this is for a L2 requirement, candidates should have strong skills in installation, configuration, administr...Show moreLast updated: 3 days ago
  • Promoted
Sr Systems Engineer Linux – AI Infrastructure

Sr Systems Engineer Linux – AI Infrastructure

DC Tech ConsultingErode, IN
Position : Senior Linux Administrator – AI / ML Infrastructure.We are seeking a highly skilled Senior Linux Administrator to join our team, focusing on the implementation and management of on-premises...Show moreLast updated: 30+ days ago
  • Promoted
DevOps Engineer

DevOps Engineer

IntraEdgeErode, IN
Seeking a skilled DevOps Engineer with strong expertise in Amazon Web Services (AWS) to join the engineering team.In this role, you will design, implement, and maintain infrastructure that enables ...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Ai Engineer (Conversational Analytics & Genai Systems)

Ai Engineer (Conversational Analytics & Genai Systems)

IRISS, Inc.Tiruppur, Republic Of India, IN
Our commitment to pushing boundaries and delivering exceptional solutions has positioned us as a trusted partner for clients seeking top-tier technical expertise in Condition Based Monitoring.IRISS...Show moreLast updated: 6 hours ago
  • Promoted
Infrastructure Solutions Architect

Infrastructure Solutions Architect

BayOne SolutionsTiruppur, IN
Systems or Solutions Architect.IaaS), and cloud-scale system design.The ideal candidate combines strong fundamentals in.Kubernetes, observability, and automation. You’ll design scalable systems that...Show moreLast updated: 3 days ago
  • Promoted
AWS Cloud Engineer

AWS Cloud Engineer

ProgliteErode, IN
Infrastructure & System Administration : .Deploy, manage, and optimize EC2 instances across dev, test, and production environments. Perform system administration and troubleshooting for Linux and Wind...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Sr. System Engineer

Sr. System Engineer

eSecsalem, tamil nadu, in
Ahmedabad, Mumbai, Lucknow, Trivandrum.Candidates from the CCTV / Surveillance, Access Control, Smart / Safe City Projects, IT Networking, and Hardware fields are highly preferred.IP-based surveillance...Show moreLast updated: 14 hours ago
  • Promoted
Generative Ai Engineer

Generative Ai Engineer

Sesheng CompanyErode, Republic Of India, IN
GenAI Engineer (Semantic Search & RAG Systems).Remote in India (to work in US Time zone).You will be instrumental in designing and deploying a cutting-edge semantic search capability to power our n...Show moreLast updated: 2 days ago