Talent.com
Site Reliability Engineer

Site Reliability Engineer

ACL DigitalBengaluru, Karnataka, India
30+ days ago
Job description

Position : SRE & DevOps (ML Framework / Ray.io / NodeJS / GO)

Location : Bangalore (Onsite)

Type of Hire : Contract

Duration : 1 year

SRE & Devops (ML Framework)

Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Python.
  • Hands-on experience with ML frameworks such as PyTorch, TensorFlow, Triton.
  • Ability to handle framework-related issues, version upgrades, and compatibility with data processing / model training environments.
  • Experience with AI / ML model training and inferencing platforms is a big plus.
  • Experience with the LLM fine tuning system is a big plus.

SRE & DevOps (Ray.io)

Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Python, C++.
  • Hands-on experience with Ray.io, including workload management, cluster deployment, distributed task scheduling, and troubleshooting.
  • Ability to use Ray Dashboard and CLI tools for monitoring, resource tracking, debugging distributed jobs, and resolving production issues.
  • Having knowledge of Ray ecosystem libraries such as Ray Train, Ray Tune, Ray Serve, and Ray Data is a big plus.
  • Experience integrating Ray with tools such as Airflow, MLflow, Dask, DeepSpeed is a big plus.
  • SRE & DevOps (NodeJS)

    Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Javascript.
  • Proficient in Node.js and web front-end / UI development.
  • Experience on jupyter notebooks.
  • Ability to integrate backend services with interactive UI components for developer productivity and ML workflow usability.
  • Experience on Visual Studio Code plugin development is a big plus
  • SRE & DevOps (GO)

    Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Go programming.
  • Hands-on experience with Kubernetes plugin / operator / CRD development.
  • Having experience with KubeRay is a big plus.
  • Having experience with K8S machine learning projects like KubeFlow is a big plus.
  • Having experience with Kubernetes scheduler is a big plus.
  • Regards,

    Harshit Garg

    Team– Talent Acquisition

    ALTEN Calsoft Labs

    2890 Zanker Road, Suite 200, San Jose, CA 95134

    Phone : +1 408-755-3060

    Email : Harshit.g@acldigital.com

    Create a job alert for this search

    Site Reliability Engineer • Bengaluru, Karnataka, India

    Related jobs
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynamediaBengaluru, Karnataka, India
    At Synamedia, the world’s most talented innovators and trailblazers are shaping the way the world is entertained and informed. We are backed by the Permira funds and Sky.This is the age of infinite ...Show moreLast updated: 12 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ReyikaBengaluru, Karnataka, India
    Senior Site Reliability Engineer / Reliability Architect.Pune,Bengalore,Chennai,Pune,Noida.Reliability Architect with over 9 years of experience in proactive monitoring, automation, and observabili...Show moreLast updated: 3 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Relevance LabGreater Bengaluru Area, India
    The ideal candidate will have a strong background in infrastructure management and a deep understanding of blockchain ecosystems. You will be responsible for designing, implementing, and maintaining...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Delta Electronics IndiaBengaluru, Karnataka, India
    Define and monitor Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets to balance reliability with feature velocity and ensure optimal system availability.Respond to...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    JRD SystemsBengaluru, Karnataka, India
    Site Reliability Engineer (Windows / Cloud / Automation) Job Summary : We are seeking an experienced Site Reliability Engineer with a strong background in managing Windows infrastructure and cloud e...Show moreLast updated: 30+ days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    SynechronBengaluru, Karnataka, India
    We have immediate opportunity for Senior Site Reliability Engineer.Senior Site Reliability Engineer.At Synechron, we believe in the power of digital to transform businesses for the better.Our globa...Show moreLast updated: 30+ days ago
    • Promoted
    Sr Site Reliability Engineer

    Sr Site Reliability Engineer

    Shell Recharge SolutionsGreater Bengaluru Area, India
    Senior Site Reliability Engineer.EV charging infrastructure at scale.Our technology is connecting EV infrastructure solutions with public and private charging needs in a safer, cleaner, and smarter...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ElgebraBangalore
    Role Overview : We are seeking a highly experienced and technically proficient Site Reliability Engineer (SRE) to join our team in support of our c...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Site Reliability Engineer

    Senior Site Reliability Engineer

    o9 Solutions, Inc.Bengaluru, Karnataka, India
    Be part of something revolutionary.At o9 Solutions, our mission is clear : be the Most Valuable Platform (MVP) for enterprises. With our AI-driven platform — the o9 Digital Brain — we integrate globa...Show moreLast updated: 9 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    GREYTIP SOFTWARE PRIVATE LIMITEDBengaluru, Karnataka, India
    About the Role We are looking for a skilled Site Reliability Engineer II to join our SRE team.The ideal candidate will have hands-on experience in production monitoring, alert handling, and L1 pro...Show moreLast updated: 6 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    KarixBengaluru, Karnataka, India
    Role : Site Reliability Engineer Location : Bangalore (WFO) About the role : We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT o...Show moreLast updated: 2 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    WhiteLotus Talent PartnersBengaluru, Karnataka, India
    L0 and L1 Site Reliability Engineer (SRE) Support.Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by. In this role, you will focu...Show moreLast updated: 30+ days ago
    • Promoted
    • New!
    Site Reliability Engineer

    Site Reliability Engineer

    Infosys Finaclehosur, tamil nadu, in
    Role : DevSecOps Developer – Secure Coding & Automation.Strong scripting skills in Python, Shell, or similar languages for automation and tooling. Should be able to design, develop, test, and deploy...Show moreLast updated: less than 1 hour ago
    • Promoted
    Site Reliability Engineer (SRE) – Infrastructure & Automation

    Site Reliability Engineer (SRE) – Infrastructure & Automation

    InstaServicehosur, tamil nadu, in
    InstaService is revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly. We’re growing fast across 23+ states and expanding...Show moreLast updated: 16 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    super.moneyBengaluru, Karnataka, India
    Site Reliability Engineer (SRE) Level 3.A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and...Show moreLast updated: 19 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Landmark GroupBengaluru, Karnataka, India
    What You’ll Do : • Ensure reliability and high availability of Java and microservices-based applications through proactive monitoring and automation. Define and track SLIs / SLOs to maintain se...Show moreLast updated: 11 days ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    ACL DigitalGreater Bengaluru Area, India
    ACL Digital is Hiring for the Below position.ACL Digital, part of the ALTEN Group, is a trusted AI-led, Digital & Systems Engineering Partner driving innovation by designing and building intelligen...Show moreLast updated: 1 day ago
    • Promoted
    Site Reliability Engineer

    Site Reliability Engineer

    Media.netBengaluru, Karnataka, India
    Our proprietary contextual technology is at the forefront of enhancing Programmatic buying, the latest industry standard in ad buying for digital platforms. HQ is based in New York, and the Global H...Show moreLast updated: 30+ days ago