Talent.com
SRE & DevOps Engineer (Ray.io)

SRE & DevOps Engineer (Ray.io)

ConfidentialIndia
7 days ago
Job description

N-iX is a global software development service company that helps businesses across the world develop successful software products. Founded in 2002, N-iX has come a long way, expanding its presence across Europe, the US, and Latin America. Today, we are a strong community of 2,000+ professionals and a reliable partner for global industry leaders and Fortune 500 companies.

Our client is a global commerce leader where you can influence how the world buys, sells, and gives. You'll be part of a work culture that's been genuinely committed to diversity and inclusion since its founding over twenty five years ago. Here, you can be yourself, do your

best work along with a team of professionals, and have a meaningful impact on people across the globe. We seek people with drive, ideas, and a passion for helping small businesses succeed to help.

About the team : You will join the AI Platform Team, providing highly available, scalable, and automated machine learning infrastructure for researchers and data scientists globally. We are looking for a motivated, self-reliant SRE / DevOps engineer with Python and C++ experience to drive operational excellence, automation, and platform reliability, with a focus on Ray.io.

About the role : This role focuses on maintaining, deploying, and improving AI / ML platform services using Ray.io, with strong emphasis on DevOps, SRE practices, and automation. You will collaborate closely with developers, researchers, and infrastructure teams to ensure robust, scalable, and highly available distributed ML systems.

Responsibilities : DevOps tasks (60%)

  • Design, implement, and maintain CI / CD pipelines for AI / ML platform services.
  • Manage and troubleshoot Kubernetes clusters, Docker containers, and cloud infrastructure.
  • Ensure high availability (99.999%), system reliability, and security across platforms.
  • Automate operational tasks, monitoring, and deployment workflows.
  • Deploy and maintain Ray.io clusters, ensuring workload scheduling and distributed job reliability.
  • Monitor production systems via Ray Dashboard, CLI tools, and integrate alerting / metrics.
  • Analyze and resolve production issues, performance bottlenecks, and functional problems.
  • Define operational standards, versioning practices, and advise teams on DevOps best practices.
  • Prepare documentation, training materials, and provide technical support to platform users.

Development tasks (40%) :

  • Design, build, and refactor Python and C++ services for Ray.io workflows.
  • Work with Ray ecosystem libraries such as Ray Train, Ray Tune, Ray Serve, Ray Data.
  • Integrate Ray with tools such as Airflow, MLflow, Dask, DeepSpeed (plus).
  • Collaborate with developers to integrate distributed ML pipelines into automated CI / CD workflows.
  • Requirements :

  • Strong Python and C++ development experience (2–4 years).
  • Hands-on experience with Ray.io : cluster deployment, workload management, distributed task scheduling.
  • Familiarity with Ray ecosystem libraries (Train, Tune, Serve, Data) and integration with ML tooling.
  • Solid understanding of Kubernetes, Docker, Linux fundamentals, and DevOps practices.
  • Experience with CI / CD pipelines (Jenkins or similar), test automation, and monitoring.
  • Strong debugging and triaging skills for distributed systems.
  • Excellent communication and collaboration skills with cross-functional teams.
  • Strong organizational skills to manage multiple projects in a fast-paced environment.
  • Fluent in English (spoken and written).
  • Overall 3-5 years of relevant DevOps / SRE experience.
  • We offer

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
  • not applicable for freelancers
  • Skills Required

    Airflow, Jenkins, Docker, Linux, Kubernetes, Python

    Create a job alert for this search

    Engineer Sre • India

    Related jobs
    • Promoted
    SRE / DevOps Engineer

    SRE / DevOps Engineer

    Tata Consultancy ServicesChennai, Republic Of India, IN
    Site Reliability Engineering (SRE)Ops.TCS has been a great pioneer in feeding the fire of young Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us ...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Datum Technologies GroupNagpur, IN
    Job Title : Site Reliability Engineer (SRE) – AWS.AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog.We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experi...Show moreLast updated: 8 days ago
    • Promoted
    • New!
    Sr Support Engineer

    Sr Support Engineer

    McLaren Strategic Solutions (MSS)Nagpur, IN
    We are looking for a Java + AWS DevOps Support Engineer with strong technical expertise and hands-on experience in both development and support roles. The ideal candidate will have a solid understan...Show moreLast updated: less than 1 hour ago
    • Promoted
    • New!
    Senior GenAI Engineer

    Senior GenAI Engineer

    Mitra AINagpur, IN
    AI System Design & Development : .Architect, develop, and deploy large-scale Generative AI, LLM-based systems, including intelligent agents and automation workflows. LLM Integration & Optimization : .In...Show moreLast updated: 5 hours ago
    • Promoted
    • New!
    Site Reliability Engineer (SRE) / DevOps Engineer

    Site Reliability Engineer (SRE) / DevOps Engineer

    Stoopa AIIndia
    AI is building next-generation AI-driven platforms for ports and is focused on reliability, speed, and intelligent automation. As we scale our next generation smart port product Turi, we are hiring ...Show moreLast updated: 6 hours ago
    • Promoted
    • New!
    Senior Security Engineer - SIEM, DevSecOps, IPS / IDS

    Senior Security Engineer - SIEM, DevSecOps, IPS / IDS

    EmburseNagpur, IN
    Emburse software engineers contribute to the development of an engaging and interconnected set of system solutions.As an engineer, you will enhance the experiences of your customers, solve interest...Show moreLast updated: 6 hours ago
    • Promoted
    Senior DevOps Engineer (SRE)

    Senior DevOps Engineer (SRE)

    MightyBotnagpur, maharashtra, in
    Title : Senior DevOps Engineer (SRE).Join our team as a Senior DevOps Engineer, where we're focused on graduating AI from interesting demos to indispensable products. You will build and maintain the ...Show moreLast updated: 7 days ago
    • Promoted
    Cloud Engineer I - SRE

    Cloud Engineer I - SRE

    ConfidentialIndia
    This role has been designed as ''Onsite' with an expectation that you will primarily work from an HPE office.Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people ...Show moreLast updated: 16 days ago
    • Promoted
    SRE / DevOps Specialist

    SRE / DevOps Specialist

    SynechronRepublic Of India, IN
    We have immediate opportunity for.Site Reliability Engineer Devop 5 to 9 years.SRE (Senior Site Reliability Engineer) Devop. We began life in 2001 as a small, self-funded team of technology speciali...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Senior DataOps Engineer (AWS)

    Senior DataOps Engineer (AWS)

    MSBC GroupNagpur, IN
    Join us as a Senior DataOps Engineer (AWS)—Drive High-Performance Data Systems for Financial Services.Lead the E-Comms data pipeline within Compass’s Application Simplification workstream : design, ...Show moreLast updated: 6 hours ago
    • Promoted
    • New!
    Senior Site Reliability Engineer (SRE)

    Senior Site Reliability Engineer (SRE)

    Voya IndiaNagpur, IN
    We are seeking a strategic and technically adept leader to drive the scalability, resilience, and operational excellence of our enterprise systems. This role will set the vision for site reliability...Show moreLast updated: 6 hours ago
    • Promoted
    • New!
    AWS devops engineer

    AWS devops engineer

    techifyappsNagpur, IN
    We are looking for a skilled AWS DevOps Engineer to join our growing engineering team.The ideal candidate should have strong hands-on experience in designing CI / CD pipelines, containerized deployme...Show moreLast updated: 6 hours ago
    • Promoted
    Cloud DevOps and SRE Leader

    Cloud DevOps and SRE Leader

    Exela TechnologiesPune, Republic Of India, IN
    Director of Cloud, DevOps, and SRE : Emphasis on Execution.Director of Cloud, DevOps, and Site Reliability Engineering (SRE). This role demands a pragmatic leader capable of translating strategic vis...Show moreLast updated: 16 days ago
    • Promoted
    Cloud Engineer II - SRE

    Cloud Engineer II - SRE

    ConfidentialIndia
    This role has been designed as ''Onsite' with an expectation that you will primarily work from an HPE office.Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people ...Show moreLast updated: 16 days ago
    • Promoted
    • New!
    Sr. Platform Engineer

    Sr. Platform Engineer

    CME GroupIndia
    Join our Technology (DevOps) team as a Sr.In this critical role, you'll leverage your expertise in CI / CD, container orchestration (Kubernetes), and infrastructure-as-code to engineer the next gener...Show moreLast updated: 6 hours ago
    • Promoted
    Cloud Engineer II SRE

    Cloud Engineer II SRE

    ConfidentialIndia
    This role has been designed as ''Onsite' with an expectation that you will primarily work from an HPE office.Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people ...Show moreLast updated: 16 days ago
    • Promoted
    Cloud Engineer II -SRE

    Cloud Engineer II -SRE

    ConfidentialIndia
    This role has been designed as ''Onsite' with an expectation that you will primarily work from an HPE office.Hewlett Packard Enterprise is the global edge-to-cloud company advancing the way people ...Show moreLast updated: 16 days ago
    • Promoted
    SRE / DevOps engineer (with Python and ML framework)

    SRE / DevOps engineer (with Python and ML framework)

    ConfidentialIndia
    N-iX is a global software development service company that helps businesses across the world develop successful software products. Founded in 2002, N-iX has come a long way, expanding its presence a...Show moreLast updated: 20 days ago