Talent.com
Site Reliability Engineer
Site Reliability EngineerACL Digital • India
Site Reliability Engineer

Site Reliability Engineer

ACL Digital • India
9 hours ago
Job description

Position : SRE & DevOps (ML Framework / Ray.io / NodeJS / GO)

Location : Bangalore (Onsite)

Type of Hire : Contract

Duration : 1 year

SRE & Devops (ML Framework)

Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Python.
  • Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton.
  • Ability to handle framework-related issues, version upgrades, and compatibility with data processing / model training environments.
  • Experience with AI / ML model training and inferencing platforms is a big plus.
  • Experience with the LLM fine tuning system is a big plus.

SRE & DevOps (Ray.io)

Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Python, C++.
  • Hands-on experience with Ray.io, including workload management, cluster deployment, distributed task scheduling, and troubleshooting.
  • Ability to use Ray Dashboard and CLI tools for monitoring, resource tracking, debugging distributed jobs, and resolving production issues.
  • Having knowledge of Ray ecosystem libraries such as Ray Train, Ray Tune, Ray Serve, and Ray Data is a big plus.
  • Experience integrating Ray with tools such as Airflow, MLflow, Dask, DeepSpeed is a big plus.
  • SRE & DevOps (NodeJS)

    Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Javascript.
  • Proficient in Node.js and web front-end / UI development.
  • Experience on jupyter notebooks.
  • Ability to integrate backend services with interactive UI components for developer productivity and ML workflow usability.
  • Experience on Visual Studio Code plugin development is a big plus
  • SRE & DevOps (GO)

    Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Go programming.
  • Hands-on experience with Kubernetes plugin / operator / CRD development.
  • Having experience with KubeRay is a big plus.
  • Having experience with K8S machine learning projects like KubeFlow is a big plus.
  • Having experience with Kubernetes scheduler is a big plus.
  • Regards,

    Harshit Garg

    Team– Talent Acquisition

    ALTEN Calsoft Labs

    2890 Zanker Road, Suite 200, San Jose, CA 95134

    Phone : Email :

    Create a job alert for this search

    Site Reliability Engineer • India

    Related jobs
    Freelance Site Reliability Engineer (SRE) / DevOps Engineer

    Freelance Site Reliability Engineer (SRE) / DevOps Engineer

    ThreatXIntel • Nagpur, IN
    ThreatXIntel is a startup cyber security company focused on delivering customized, affordable solutions to protect businesses and organizations from cyber threats. Our experienced team specializes i...Show more
    Last updated: 4 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    super.money • India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show more
    Last updated: 9 hours ago • Promoted • New!
    Site Reliability Engineer (SRE)

    Site Reliability Engineer (SRE)

    HRhelpdesk • India
    Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.As a Site Reliability Engineer (SRE), you will be responsible for building and maintainin...Show more
    Last updated: 4 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Infosys Finacle • nagpur, maharashtra, in
    Role : DevSecOps Developer – Secure Coding & Automation.Strong scripting skills in Python, Shell, or similar languages for automation and tooling. Should be able to design, develop, test, and deploy...Show more
    Last updated: 1 day ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Sails Software Inc • India
    We are looking for an experienced and driven Senior Site Reliability Engineer (SRE) to architect, implement, and maintain robust cloud infrastructure. This role demands a deep understanding of AWS, ...Show more
    Last updated: 9 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    VXI Global Solutions • India
    We are looking for a Site Reliability Engineer with 3+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications.The id...Show more
    Last updated: 9 hours ago • Promoted • New!
    Aws Site Reliability Engineer

    Aws Site Reliability Engineer

    HTC Global Services • Chennai, Republic Of India, IN
    Troy, Michigan, is a leading global Information Technology solution and BPO provider.HTC assists clients across multiple industry verticals, offering turnkey project lifecycle in, e-business, data ...Show more
    Last updated: 19 days ago • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaService • Nagpur, IN
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show more
    Last updated: 18 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Yum! India Global Services Private Limited • India
    Design, test, implement, deploy, and support continuous integration pipelines that build and deploy to cloud-based environments (development, stage / testing, production). In this role, you will help ...Show more
    Last updated: 9 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    HRhelpdesk • Indore, Republic Of India, IN
    Company is a rapidly growing, private equity backed SaaS product company and provides cloud-based solutions.As a Site Reliability Engineer (SRE), you will be responsible for building and maintainin...Show more
    Last updated: 9 days ago • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc. • India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 9 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Delta Electronics India • India
    Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to balance reliability with feature velocity and ensure optimal system availability.Respond to...Show more
    Last updated: 9 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Synamedia • India
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show more
    Last updated: 9 hours ago • Promoted • New!
    Senior Site Reliability Engineer (C# / Python)

    Senior Site Reliability Engineer (C# / Python)

    Entech • Nagpur, IN
    Senior Software Site Reliability Engineer (C# / Python).You’ll ensure enterprise systems are reliable, scalable, and performant - driving improvements, leading SRE initiatives, and mentoring teams on...Show more
    Last updated: 5 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Grootan Technologies • Chennai, Republic Of India, IN
    Site Reliability Engineer (SRE).In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications.You will leverage your e...Show more
    Last updated: 9 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    PhonePe • Pune, Republic Of India, IN
    Troubleshoot issues across the entire stack - hardware, software, application, and network.Work to improve the reliability and performance of the next generation of distributed systems.Work to impr...Show more
    Last updated: 19 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    JRD Systems • India
    Site Reliability Engineer (Windows / Cloud / Automation).We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud environments.T...Show more
    Last updated: 9 hours ago • Promoted • New!
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent Partners • Nagpur, IN
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show more
    Last updated: 4 hours ago • Promoted • New!