Talent.com
Senior Linux System Administrator
Senior Linux System AdministratorGoodSpace AI • kottayam, kerala, in
No longer accepting applications
Senior Linux System Administrator

Senior Linux System Administrator

GoodSpace AI • kottayam, kerala, in
3 days ago
Job description

Job Title : Linux Infrastructure Engineer – HPC & Cloud

Location : Kurla, Mumbai

Type : Onsite, 5 days a week

Position Overview :

We are seeking a skilled HPC Linux System Administrator to manage and optimize our high-performance computing infrastructure. In this role, you’ll be responsible for deploying, configuring, and maintaining scalable Linux-based HPC systems that power our AI workloads.

You’ll ensure system performance, reliability, and security across compute clusters, storage, and networking. This role is ideal for someone with deep Linux expertise, experience in HPC environments, and a passion for supporting cutting-edge AI research and development.

Key Responsibilities :

  • Linux Systems Administration
  • Install, configure, harden, and maintain Linux systems (RHEL, CentOS, Ubuntu).
  • Manage system upgrades, patch cycles, kernel tuning, and storage configuration.
  • Automation & Provisioning
  • Create and manage infrastructure-as-code (IaC) using Ansible, Terraform, and shell / Python scripts.
  • Provision bare-metal and virtual infrastructure using Foreman, MAAS, or Cobbler.
  • Monitoring & Observability
  • Set up and optimize tools like Prometheus, Grafana, Zabbix, Nagios, or Telegraf.
  • Generate insights into infrastructure and service performance to detect and resolve anomalies proactively.
  • Security & Compliance
  • Enforce security best practices including SELinux, firewalls, and regular vulnerability assessments.
  • Configure secure access controls (LDAP, SSSD, PAM) and audit policies.
  • Containerization & Orchestration
  • Deploy and manage scalable workloads using Docker and Kubernetes.
  • Design CI / CD workflows and infrastructure using Jenkins, GitLab CI, or ArgoCD.
  • GPU & HPC Technologies
  • Configure and optimize GPU clusters using NVIDIA cards and CUDA libraries.
  • Set up GPUDirect RDMA and NVLink for ultra-low latency data transfer in distributed AI / ML environments.
  • HPC / GPU Benchmarking.
  • Tune performance for parallel workloads and manage Slurm or PBS batch schedulers.
  • Virtualization & Cloud Integration
  • Work with KVM, VMware, and Proxmox.
  • Manage hybrid and public cloud infrastructure via AWS, Azure, or Google Cloud.
  • Implement cloud orchestration and auto-scaling infrastructure for compute-intensive workloads.
  • Collaboration & Mentorship
  • Actively collaborate with DevOps, engineering, and research teams to align system design with workload demands.
  • Mentor junior team members and lead knowledge-sharing initiatives.
  • Documentation & Reporting
  • Maintain clear documentation for procedures, system configurations, and architecture diagrams
  • Create reports on uptime, security compliance, system health, and capacity planning.

Requirement & Qualification :

  • Deep expertise in Linux system administration and performance tuning.
  • Strong scripting skills in Bash, Python, or Perl.
  • Solid understanding of TCP / IP, DNS, DHCP, firewalls, and general network principles.
  • Hands-on experience with Ansible, Terraform, or similar tools.
  • Familiarity with Grafana, Prometheus, Zabbix, and log monitoring stacks (e.g., ELK, Loki).
  • Good to have skills :

  • Experience with GPU-accelerated workloads (NVIDIA, CUDA, GPUDirect RDMA).
  • Knowledge of Slurm, PBS, or HPC job schedulers.
  • Background in DevOps practices, including GitOps, CI / CD pipelines, and Infrastructure-as-Code.
  • Prior experience working with large-scale, high-availability systems.
  • Analytical mindset with a knack for debugging complex systems.
  • Excellent communication and mentoring skills.
  • Empathy and patience when dealing with diverse users—tech-savvy or not.
  • Ability to weigh system design trade-offs and make pragmatic choices.
  • Create a job alert for this search

    System Administrator • kottayam, kerala, in

    Related jobs
    System Administrator

    System Administrator

    MGT-COMMERCE GmbH • Alappuzha, IN
    MGT-Commerce is a Berlin-based company founded in 2010 that specializes in providing managed cloud hosting services for Magento e-commerce shops on top of Amazon Web Services (AWS).As an AWS Advanc...Show more
    Last updated: 30+ days ago • Promoted
    T24 System Admin

    T24 System Admin

    Systems Limited • Kottayam, IN
    We are looking for a highly skilled and experienced T24 System Admin to provide technical support and troubleshooting for our T24 COB processes. The successful candidate will be responsible for ensu...Show more
    Last updated: 4 days ago • Promoted
    Lead System Architect

    Lead System Architect

    Pegasystems • Kottayam, IN
    Pegasystems develops strategic applications for sales, marketing, service and operations.Pega's applications streamline critical business operations, connect enterprises to their customers seamless...Show more
    Last updated: 10 days ago • Promoted
    CyberArk SME

    CyberArk SME

    NuSummit Cybersecurity • alappuzha, India
    CyberArk SME – 6+ year, remote.CyberArk CDE certification is Mandatory.CyberArk SaaS implementation and understanding of on-prem components requirements. Onboarding of devices- Kubernetes, Windows, ...Show more
    Last updated: 13 days ago • Promoted
    Linux Engineer

    Linux Engineer

    TerraGiG • alappuzha, kerala, in
    Bachelor's degree in Information Technology, Computer Science or a related field or equivalent practical experience.Proven experience as a Linux architect, systems engineer, or DevOps engineer in e...Show more
    Last updated: 11 days ago • Promoted
    Linux System Administrator (AWS Specialist)

    Linux System Administrator (AWS Specialist)

    MGT-COMMERCE GmbH • Kochi, IN
    Do you live and breathe Linux? Do you enjoy building and managing servers in the cloud?.Linux-focused System Administrator. AWS infrastructure and keep systems running at peak performance.Setting up...Show more
    Last updated: 30+ days ago • Promoted
    Principal IoT Embedded System Architect

    Principal IoT Embedded System Architect

    Faststream Technologies • Kochi, IN
    Own System Architecture & Collaborate with client teams to understand their IoT and Embedded product requirements.Investigate, analyze, review, and enhance functionality and modules for existing & ...Show more
    Last updated: 30+ days ago • Promoted
    Senior AppDynamics Observability SME

    Senior AppDynamics Observability SME

    Dexian India • Kochi, IN
    Position Title : Senior AppDynamics Observability SME.IT operations, system administration, or engineering.Ansible, Jenkins, Terraform, Python to develop configuration, deployment, and orchestration...Show more
    Last updated: 23 days ago • Promoted
    Windows Administrator

    Windows Administrator

    Celestica • Kottayam, IN
    Manage and support Intel / VMware / Windows servers / operating systems and Active Directory that constitute the Celestica global enterprise server environment. These duties include Vmware and Windows Ser...Show more
    Last updated: 13 days ago • Promoted
    AutoSys Administrator

    AutoSys Administrator

    Atyeti Inc • Kochi, IN
    Install and configure AutoSys components (AE, EEM, WCC, Agents).Perform upgrades and patching of AutoSys environments.Create, modify, and manage job definitions using JIL and WCC.Schedule and monit...Show more
    Last updated: 6 days ago • Promoted
    Senior Engineer

    Senior Engineer

    Ignitarium • Kochi, Kerala, India
    Experienced with Sequans LTE-M / NB-IoT modems,.Show more
    Last updated: 12 days ago • Promoted
    Senior Kubernetes Network Engineer

    Senior Kubernetes Network Engineer

    World Wide Technology • Alappuzha, IN
    World Wide Technology Holding Co, LLC (WWT).Through our culture of innovation, we inspire, build and deliver business results, from idea to outcome. Louis, WWT works closely with industry leaders su...Show more
    Last updated: 13 days ago • Promoted
    Firmware Engineer - Embedded System

    Firmware Engineer - Embedded System

    Xped pvt Ltd • Kerala
    Job Description : - Responsible for developing and supporting zPDT features using C and C++ on Linux environments.Collaborates with Archite...Show more
    Last updated: 30+ days ago • Promoted
    Senior Cloud & Systems Administrator

    Senior Cloud & Systems Administrator

    Confidential • India, Cochin / Kochi / Ernakulam
    System Administrator – Job Description.The Senior System Administrator will be responsible for managing our Azure cloud infrastructure, Office . The ideal candidate should have a deep understanding ...Show more
    Last updated: 7 days ago • Promoted
    Aix System Administrator

    Aix System Administrator

    Tata Consultancy Services • Kottayam, IN
    Come and join us for an exciting career with TCS!!!.Must Have Experiences and Skills : .As this is for a L2 requirement, candidates should have strong skills in installation, configuration, administr...Show more
    Last updated: 13 days ago • Promoted
    Technical Advisor - Kernel Networking

    Technical Advisor - Kernel Networking

    WatchGuard Technologies • Kottayam, IN
    Core skills required : Linux Kernel, Network device driver development, Linux internals, Networking stack.Good to have : Data plane development kit (DPDK) and Vector Packet Processor (VPP).You are a...Show more
    Last updated: 22 days ago • Promoted
    Middleware Administrator

    Middleware Administrator

    Capgemini • Kottayam, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 3 days ago • Promoted
    Lead Software Engineer( Linux Kernel Developer )

    Lead Software Engineer( Linux Kernel Developer )

    DDN • Alappuzha, IN
    This is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a globa...Show more
    Last updated: 6 days ago • Promoted