Talent.com
Application Reliability Engineer

Application Reliability Engineer

AMISEQBengaluru, Republic Of India, IN
2 days ago
Job description

SRE & DevOps (Ray.Io)

Bengaluru, KA

Experience - 3-6 years

Job Functions :

You will be a member of our AI Platform Team, supporting the next generation AI architecture for various research and engineering teams within the organization.

  • You'll partner with vendors and the infrastructure engineering team for security and service availability
  • You'll fix production issues with engineering teams, researchers, data scientists, including performance and functional issues
  • Diagnose and solve customer technical problems
  • Participate in training customers and prepare reports on customer issues
  • Be responsible for customer service improvements and recommend product improvements
  • Write support documentation
  • You'll design and implement zero-downtime to monitor and accomplish a highly available service (99.999%)

Required Skills :

  • Demonstrated ability in designing, building, refactoring and releasing software written in Python, C++.
  • Hands-on experience with Ray.Io, including workload management, cluster deployment, distributed task scheduling, and troubleshooting.
  • Ability to use Ray Dashboard and CLI tools for monitoring, resource tracking, debugging distributed jobs, and resolving production issues.
  • Having knowledge of Ray ecosystem libraries such as Ray Train, Ray Tune, Ray Serve, and Ray Data is a big plus.
  • Experience integrating Ray with tools such as Airflow, MLflow, Dask, DeepSpeed is a big plus.
  • Debugging and triaging skills.
  • Cloud technologies like Kubernetes, Docker and Linux fundamentals.
  • Familiar with DevOps practices and continuous testing.
  • DevOps pipeline and automations : app deployment / configuration & performance monitoring.
  • Test automations, Jenkins CI / CD.
  • Excellent communication, presentation, and leadership skills to be able to work and collaborate with partners, customers and engineering teams.
  • Well organized and able to manage multiple projects in a fast paced and demanding environment.
  • Good oral / reading / writing English ability.
  • Create a job alert for this search

    Application Engineer • Bengaluru, Republic Of India, IN