Talent.com
Site Reliability Engineer
Site Reliability EngineerACL Digital • bangalore, karnataka, in
No longer accepting applications
Site Reliability Engineer

Site Reliability Engineer

ACL Digital • bangalore, karnataka, in
30+ days ago
Job description

Position : SRE & DevOps (ML Framework / Ray.io / NodeJS / GO)

Location : Bangalore (Onsite)

Type of Hire : Contract

Duration : 1 year

SRE & Devops (ML Framework)

Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Python.
  • Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton.
  • Ability to handle framework-related issues, version upgrades, and compatibility with data processing / model training environments.
  • Experience with AI / ML model training and inferencing platforms is a big plus.
  • Experience with the LLM fine tuning system is a big plus.

SRE & DevOps (Ray.io)

Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Python, C++.
  • Hands-on experience with Ray.io, including workload management, cluster deployment, distributed task scheduling, and troubleshooting.
  • Ability to use Ray Dashboard and CLI tools for monitoring, resource tracking, debugging distributed jobs, and resolving production issues.
  • Having knowledge of Ray ecosystem libraries such as Ray Train, Ray Tune, Ray Serve, and Ray Data is a big plus.
  • Experience integrating Ray with tools such as Airflow, MLflow, Dask, DeepSpeed is a big plus.
  • SRE & DevOps (NodeJS)

    Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Javascript.
  • Proficient in Node.js and web front-end / UI development.
  • Experience on jupyter notebooks.
  • Ability to integrate backend services with interactive UI components for developer productivity and ML workflow usability.
  • Experience on Visual Studio Code plugin development is a big plus
  • SRE & DevOps (GO)

    Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Go programming.
  • Hands-on experience with Kubernetes plugin / operator / CRD development.
  • Having experience with KubeRay is a big plus.
  • Having experience with K8S machine learning projects like KubeFlow is a big plus.
  • Having experience with Kubernetes scheduler is a big plus.
  • Regards,

    Harshit Garg

    Team– Talent Acquisition

    ALTEN Calsoft Labs

    2890 Zanker Road, Suite 200, San Jose, CA 95134

    Phone : +1 408-755-3060

    Email : Harshit.g@acldigital.com

    Create a job alert for this search

    Site Reliability Engineer • bangalore, karnataka, in

    Related jobs
    Site Reliability Engineer

    Site Reliability Engineer

    GlobalFoundries • Bengaluru, Karnataka, India
    GlobalFoundriesis a leading full-service semiconductor foundry providing a unique combination of design development and fabrication services to some of the worlds most inspired technology companies...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    AION • Bengaluru, KA, IN
    Quick Apply
    AION is building the next generation of AI cloud platform by transforming the future of high-performance computing (HPC) through its decentralized AI cloud. Purpose-built for bare-metal performance,...Show more
    Last updated: 30+ days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Synamedia • Bengaluru, Karnataka, India
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show more
    Last updated: 16 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Reyika • Bengaluru, Karnataka, India
    Senior Site Reliability Engineer / Reliability Architect.Pune,Bengalore,Chennai,Pune,Noida.Reliability Architect with over 9 years of experience in proactive monitoring, automation, and observabili...Show more
    Last updated: 7 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Delta Electronics India • Bengaluru, Karnataka, India
    Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to balance reliability with feature velocity and ensure optimal system availability.Respond to...Show more
    Last updated: 5 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Yum! India Global Services Private Limited • Bengaluru, India
    Design, test, implement, deploy, and support continuous integration pipelines that build and deploy to cloud-based environments (development, stage / testing, production). In this role, you will help ...Show more
    Last updated: 4 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Synechron • Bengaluru, Karnataka, India
    We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer IC3

    Site Reliability Engineer IC3

    Oracle • Bengaluru, Republic Of India, IN
    Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.Design, write, and deploy software to improve the availability, scalability, and e...Show more
    Last updated: 15 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GREYTIP SOFTWARE PRIVATE LIMITED • Bengaluru, Karnataka, India
    About the Role We are looking for a skilled Site Reliability Engineer II to join our SRE team.The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 p...Show more
    Last updated: 10 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Aqilea (formerly Soltia) • Bangalore, Karnataka, India
    Quick Apply
    We are a consulting company with a bunch of technology-interested and happy people!.We love technology, we love design and we love quality. Our diversity makes us unique and creates an inclusive and...Show more
    Last updated: 30+ days ago
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc. • Bengaluru, Karnataka, India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer - 2

    Site Reliability Engineer - 2

    Confidential • Bengaluru / Bangalore
    MoEngage, you'll be a critical member of our SRE team, responsible for the health and performance of key services and contributing directly to the evolution of our infrastructure at a scale that fe...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent Partners • Bengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.money • Bengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show more
    Last updated: 22 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    MSCI • Bangalore, Karnataka, India
    The successful candidate shall be part of the ESG Production and Application Management Team.Our team provides a tier 2 / 3 support to proprietary MSCI ESG Business. This position involves collabora...Show more
    Last updated: 6 days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Bengaluru, IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Landmark Group • Bengaluru, India
    Ensure reliability and high availability of Java and microservices-based applications through proactive monitoring and automation. Define and track SLIs / SLOs to maintain service performance and ...Show more
    Last updated: 13 days ago • Promoted
    Site Reliability Engineer I

    Site Reliability Engineer I

    Backblaze External Website • Bengaluru, Karnataka, India
    Backblaze is the object storage leader in the open cloud movement fueling customer success with cloud storage built purposefully to unlock budgets unburden administrators and unleash innovators.Tog...Show more
    Last updated: 12 days ago • Promoted