Talent.com
Senior Linux System Administrator
Senior Linux System AdministratorGoodSpace AI • thiruvananthapuram, kerala, in
No longer accepting applications
Senior Linux System Administrator

Senior Linux System Administrator

GoodSpace AI • thiruvananthapuram, kerala, in
2 days ago
Job description

Job Title : Linux Infrastructure Engineer – HPC & Cloud

Location : Kurla, Mumbai

Type : Onsite, 5 days a week

Position Overview :

We are seeking a skilled HPC Linux System Administrator to manage and optimize our high-performance computing infrastructure. In this role, you’ll be responsible for deploying, configuring, and maintaining scalable Linux-based HPC systems that power our AI workloads.

You’ll ensure system performance, reliability, and security across compute clusters, storage, and networking. This role is ideal for someone with deep Linux expertise, experience in HPC environments, and a passion for supporting cutting-edge AI research and development.

Key Responsibilities :

  • Linux Systems Administration
  • Install, configure, harden, and maintain Linux systems (RHEL, CentOS, Ubuntu).
  • Manage system upgrades, patch cycles, kernel tuning, and storage configuration.
  • Automation & Provisioning
  • Create and manage infrastructure-as-code (IaC) using Ansible, Terraform, and shell / Python scripts.
  • Provision bare-metal and virtual infrastructure using Foreman, MAAS, or Cobbler.
  • Monitoring & Observability
  • Set up and optimize tools like Prometheus, Grafana, Zabbix, Nagios, or Telegraf.
  • Generate insights into infrastructure and service performance to detect and resolve anomalies proactively.
  • Security & Compliance
  • Enforce security best practices including SELinux, firewalls, and regular vulnerability assessments.
  • Configure secure access controls (LDAP, SSSD, PAM) and audit policies.
  • Containerization & Orchestration
  • Deploy and manage scalable workloads using Docker and Kubernetes.
  • Design CI / CD workflows and infrastructure using Jenkins, GitLab CI, or ArgoCD.
  • GPU & HPC Technologies
  • Configure and optimize GPU clusters using NVIDIA cards and CUDA libraries.
  • Set up GPUDirect RDMA and NVLink for ultra-low latency data transfer in distributed AI / ML environments.
  • HPC / GPU Benchmarking.
  • Tune performance for parallel workloads and manage Slurm or PBS batch schedulers.
  • Virtualization & Cloud Integration
  • Work with KVM, VMware, and Proxmox.
  • Manage hybrid and public cloud infrastructure via AWS, Azure, or Google Cloud.
  • Implement cloud orchestration and auto-scaling infrastructure for compute-intensive workloads.
  • Collaboration & Mentorship
  • Actively collaborate with DevOps, engineering, and research teams to align system design with workload demands.
  • Mentor junior team members and lead knowledge-sharing initiatives.
  • Documentation & Reporting
  • Maintain clear documentation for procedures, system configurations, and architecture diagrams
  • Create reports on uptime, security compliance, system health, and capacity planning.

Requirement & Qualification :

  • Deep expertise in Linux system administration and performance tuning.
  • Strong scripting skills in Bash, Python, or Perl.
  • Solid understanding of TCP / IP, DNS, DHCP, firewalls, and general network principles.
  • Hands-on experience with Ansible, Terraform, or similar tools.
  • Familiarity with Grafana, Prometheus, Zabbix, and log monitoring stacks (e.g., ELK, Loki).
  • Good to have skills :

  • Experience with GPU-accelerated workloads (NVIDIA, CUDA, GPUDirect RDMA).
  • Knowledge of Slurm, PBS, or HPC job schedulers.
  • Background in DevOps practices, including GitOps, CI / CD pipelines, and Infrastructure-as-Code.
  • Prior experience working with large-scale, high-availability systems.
  • Analytical mindset with a knack for debugging complex systems.
  • Excellent communication and mentoring skills.
  • Empathy and patience when dealing with diverse users—tech-savvy or not.
  • Ability to weigh system design trade-offs and make pragmatic choices.
  • Create a job alert for this search

    System Administrator • thiruvananthapuram, kerala, in

    Related jobs
    AutoSys Administrator

    AutoSys Administrator

    Atyeti Inc • Thiruvananthapuram, IN
    Install and configure AutoSys components (AE, EEM, WCC, Agents).Perform upgrades and patching of AutoSys environments.Create, modify, and manage job definitions using JIL and WCC.Schedule and monit...Show more
    Last updated: 5 days ago • Promoted
    T24 System Admin

    T24 System Admin

    Systems Limited • Thiruvananthapuram, IN
    We are looking for a highly skilled and experienced T24 System Admin to provide technical support and troubleshooting for our T24 COB processes. The successful candidate will be responsible for ensu...Show more
    Last updated: 3 days ago • Promoted
    Windows Administrator

    Windows Administrator

    Celestica • kollam, kerala, in
    Manage and support Intel / VMware / Windows servers / operating systems and Active Directory that constitute the Celestica global enterprise server environment. These duties include Vmware and Windows Ser...Show more
    Last updated: 12 days ago • Promoted
    Linux Administrator

    Linux Administrator

    Alliance Recruitment Agency • Thiruvananthapuram, IN
    Role : - Senior Infrastructure / Linux Administrator.Exp : -8+ yrs ( Very fluent in speaking accent neutral English).Senior Infrastructure / Linux Administrator with a lot of experience in the IT sector.H...Show more
    Last updated: 3 days ago • Promoted
    Apigee, GCP(Linux+Networking)_Chennai, Bangalore, Gandhinagar

    Apigee, GCP(Linux+Networking)_Chennai, Bangalore, Gandhinagar

    Tata Consultancy Services • Thiruvananthapuram, IN
    TCS Hiring for Apigee, GCP(Linux+Networking).Job Location : Chennai, Bangalore, Gandhinagar.APIGEE, GCP, Linux, Apigee OPDK, Associate Cloud Engineer. Must Have : Associate cloud Engineer Certified.Ha...Show more
    Last updated: 19 hours ago • Promoted • New!
    System Administrator

    System Administrator

    MGT-COMMERCE GmbH • Kollam, IN
    MGT-Commerce is a Berlin-based company founded in 2010 that specializes in providing managed cloud hosting services for Magento e-commerce shops on top of Amazon Web Services (AWS).As an AWS Advanc...Show more
    Last updated: 30+ days ago • Promoted
    Aix System Administrator

    Aix System Administrator

    Tata Consultancy Services • Kollam, IN
    Come and join us for an exciting career with TCS!!!.Must Have Experiences and Skills : .As this is for a L2 requirement, candidates should have strong skills in installation, configuration, administr...Show more
    Last updated: 12 days ago • Promoted
    System Support Engineer

    System Support Engineer

    Soffit Infrastructure Services (P) Ltd • Trivandrum, Kerala, India
    Soffit is seeking a dedicated and qualified.The selected candidate will ensure high system availability, reliable service delivery, and optimized performance. The role requires hands-on experience w...Show more
    Last updated: 30+ days ago • Promoted
    System Administrator - Azure Cloud Infrastructure

    System Administrator - Azure Cloud Infrastructure

    PromptTech Global • Thiruvananthapuram
    We are seeking a skilled System Administrator with expertise in Azure cloud environments, Linux administration, and Nginx web servers. The ideal candidate will be responsible for managing and mainta...Show more
    Last updated: 30+ days ago • Promoted
    Senior Windows System Administrator

    Senior Windows System Administrator

    Clustrex Data Private Limited • kollam, India
    Job Title : Windows System Administrator.Must have partial overlap with 8 AM – 5 PM EST ( 6 : 30 pm- 3 : 30 am IST).We are looking for an experienced Windows System Administrator to support, manage, and...Show more
    Last updated: 1 day ago • Promoted
    Lead System Architect

    Lead System Architect

    Pegasystems • Thiruvananthapuram, IN
    Pegasystems develops strategic applications for sales, marketing, service and operations.Pega's applications streamline critical business operations, connect enterprises to their customers seamless...Show more
    Last updated: 9 days ago • Promoted
    Principal IoT Embedded System Architect

    Principal IoT Embedded System Architect

    Faststream Technologies • Kollam, IN
    Own System Architecture & Collaborate with client teams to understand their IoT and Embedded product requirements.Investigate, analyze, review, and enhance functionality and modules for existing & ...Show more
    Last updated: 30+ days ago • Promoted
    Linux System Administrator

    Linux System Administrator

    Confidential • Thiruvananthapuram, Thiruvananthapuram / Trivandrum, India
    Years experience in LAMP, Min 2 years' experience in Automation.The System Administrator will manage and maintain the LAMP (Linux, Apache, MySQL, PHP) platform, ensuring optimal performance and sta...Show more
    Last updated: 16 days ago • Promoted
    Linux System Administrator (AWS Specialist)

    Linux System Administrator (AWS Specialist)

    MGT-COMMERCE GmbH • Kollam, IN
    Do you live and breathe Linux? Do you enjoy building and managing servers in the cloud?.Linux-focused System Administrator. AWS infrastructure and keep systems running at peak performance.Setting up...Show more
    Last updated: 30+ days ago • Promoted
    Senior Windows System Administrator

    Senior Windows System Administrator

    stackway • Thiruvananthapuram, IN
    System Administrator | 4+ Years Experience | Windows Server, AWS, SQL.At stackway, we believe software development is best done in small, highly functional teams with good collaboration, cooperatio...Show more
    Last updated: 19 hours ago • Promoted • New!
    System Engineer L2 Linux Kubernetes

    System Engineer L2 Linux Kubernetes

    SpeedMart • Thiruvananthapuram, IN
    Our client is a global IT services company that helps businesses with digital transformation with offices in India and the United States. It helps businesses with digital transformation, provide IT ...Show more
    Last updated: 19 hours ago • Promoted • New!
    Technical Advisor - Kernel Networking

    Technical Advisor - Kernel Networking

    WatchGuard Technologies • Kollam, IN
    Core skills required : Linux Kernel, Network device driver development, Linux internals, Networking stack.Good to have : Data plane development kit (DPDK) and Vector Packet Processor (VPP).You are a...Show more
    Last updated: 21 days ago • Promoted
    Linux Engineer

    Linux Engineer

    TerraGiG • kollam, kerala, in
    Bachelor's degree in Information Technology, Computer Science or a related field or equivalent practical experience.Proven experience as a Linux architect, systems engineer, or DevOps engineer in e...Show more
    Last updated: 10 days ago • Promoted