Job Title : AI Infrastructure & Performance Engineer
Experience : 4 to 6 Years
Location : Remote
Job Type : Full-time / Contract
Joiners : Immediate or short notice preferred
AI Infrastructure & Performance Engineer (1 Position) Primary Responsibilities Setup and manage model deployment infrastructure
Optimize inference speed and resource utilization
Monitor and scale AI services
Implement security for model deployments
Manage costs and optimize compute resource usage
Detailed Skillset Model Serving : Ray Serve, vLLM, optimized inference engines
Infrastructure : Kubernetes, Docker, cloud deployments (AWS / Azure / GCP)
Performance Optimization : Quantization, caching, batch processing
Monitoring : Prometheus, Grafana
Security : Secure model serving, access control, input validation
DevOps : CI / CD pipelines, automated scaling
Cost Management : Resource tracking and optimization
Tools & Technologies Kubernetes, Docker, Helm
Ray Serve, vLLM
Prometheus, Grafana
AWS / Azure / GCP AI services
Security scanning and access management tools
Performance Engineer • Ajmer, Rajasthan, India