Talent.com
No longer accepting applications
Ai Infrastructure & Performance Engineer

Ai Infrastructure & Performance Engineer

NowWiN InternationalKottayam, Republic Of India, IN
2 days ago
Job description

Job Title : AI Infrastructure & Performance Engineer

Experience : 4 to 6 Years

Location : Remote

Job Type : Full-time / Contract

Joiners : Immediate or short notice preferred

AI Infrastructure & Performance Engineer (1 Position) Primary Responsibilities

  • Setup and manage model deployment infrastructure
  • Optimize inference speed and resource utilization
  • Monitor and scale AI services
  • Implement security for model deployments
  • Manage costs and optimize compute resource usage

Detailed Skillset

  • Model Serving : Ray Serve, vLLM, optimized inference engines
  • Infrastructure : Kubernetes, Docker, cloud deployments (AWS / Azure / GCP)
  • Performance Optimization : Quantization, caching, batch processing
  • Monitoring : Prometheus, Grafana
  • Security : Secure model serving, access control, input validation
  • DevOps : CI / CD pipelines, automated scaling
  • Cost Management : Resource tracking and optimization
  • Tools & Technologies

  • Kubernetes, Docker, Helm
  • Ray Serve, vLLM
  • Prometheus, Grafana
  • AWS / Azure / GCP AI services
  • Security scanning and access management tools
  • Create a job alert for this search

    Performance Engineer • Kottayam, Republic Of India, IN