Description : About the Role :
We are seeking a passionate and highly skilled AI / ML Engineer with hands-on experience in Generative AI, Predictive Modeling, and API development.
The ideal candidate will design, develop, and deploy intelligent systems and scalable architecturesboth monolithic and microservices-basedthat integrate advanced AI capabilities into production-ready applications.
Key Responsibilities :
- Design and implement AI / ML solutions for generative and predictive use cases using state-of-the-art models and frameworks.
- Develop, train, fine-tune, and optimize machine learning models, including LLMs, transformers, and forecasting models.
- Build and deploy RESTful APIs using FastAPI or Flask to serve AI / ML models efficiently.
- Containerize applications using Docker and ensure smooth deployment across environments.
- Collaborate with data engineers and backend teams to design scalable architecturesmonolithic and microservice-basedfor AI pipelines and APIs.
- Integrate APIs and models with frontend systems and ensure performance, security, and maintainability.
- Conduct model evaluation, A / B testing, and continuous improvement of deployed AI systems.
- Use tools like Postman for API testing and documentation.
- Monitor production systems, identify bottlenecks, and optimize for performance and scalability.
Required Skills & Qualifications :
Bachelors or masters degree in computer science, AI / ML, Data Science, or related field.3+ years of experience in developing and deploying machine learning or AI solutions.Proven expertise in Generative AI (LLMs, diffusion models, or embeddings) and Predictive AI (forecasting, regression, classification).Strong proficiency in Python and popular AI / ML frameworks such as PyTorch, TensorFlow, scikit-learn, or Hugging Face Transformers.Hands-on experience with FastAPI or Flask for model deployment and API design.Experience with Docker and container-based deployments.Understanding of software architecture principles, including monolithic and microservices design patterns.Strong knowledge of REST APIs, Postman, and version control systems (Git).Familiarity with cloud platforms (AWS, GCP, or Azure) and CI / CD pipelines is a plus.Excellent problem-solving, debugging, and collaboration skills.Experience with vector databases, LLM fine-tuning, or retrieval-augmented generation (RAG).Exposure to data pipelines, message queues, and asynchronous architectures.Prior experience building AI-powered APIs for production-scale systems.(ref : hirist.tech)