AI Engineer - Job Description :
We are looking for a skilled AI Engineer to develop, implement, and optimize AI solutions using cutting-edge generative AI technologies.
This role focuses on hands-on development of AI applications, model integration, and production deployment while working closely with our AI architecture team.
Key Responsibilities :
AI Application Development :
- Build and maintain AI-powered applications using LLM APIs (OpenAI, Claude, Gemini, etc.)
- Implement prompt engineering strategies and optimize model interactions for performance and cost
- Develop custom AI workflows including retrieval-augmented generation (RAG) systems
- Create robust error handling, fallback mechanisms, and response validation systems
Model Implementation & Fine-tuning :
Fine-tune open-source models (Llama, Mistral, etc.) for specific use cases and domainsImplement training pipelines using modern frameworks (PyTorch, Hugging Face Transformers)Conduct model evaluation, A / B testing, and performance optimizationManage model versioning and experiment trackingProduction Deployment & Operations :
Deploy AI models and applications using FastAPI, Docker, and KubernetesBuild scalable microservices architecture for AI applicationsImplement monitoring, logging, and alerting for production AI systemsOptimize inference performance, latency, and resource utilizationIntegration & Data Engineering :
Integrate AI capabilities into existing systems and workflowsBuild data pipelines for model training and inferenceWork with vector databases and embedding systemsImplement caching strategies and data preprocessing pipelinesRequired Qualifications :
Experience :
3+ years of experience in machine learning or software engineering1+ years of hands-on experience with generative AI and LLM integrationDemonstrated experience deploying ML models in production environmentsTechnical Skills :
Proficiency with LLM APIs and SDKs (OpenAI, Anthropic, Google, etc.)Experience fine-tuning transformer models using Hugging Face, PyTorch, or similarStrong proficiency in Python and modern software development practicesHands-on experience with FastAPI, Docker, and containerizationExperience with Kubernetes for container orchestrationKnowledge of RESTful API design and microservices architectureCore Competencies :
Understanding of transformer architectures, embeddings, and attention mechanismsExperience with prompt engineering and model optimization techniquesFamiliarity with MLOps practices and tools (model versioning, monitoring, CI / CD)Strong debugging and troubleshooting skillsAbility to work with large-scale data and distributed systems.Preferred Qualifications :
Experience with vector databases (Pinecone, Weviate, Chroma, etc.)Knowledge of retrieval-augmented generation (RAG) implementationExperience with model quantization and optimization techniquesFamiliarity with cloud platforms (AWS, GCP, Azure) and their AI servicesExperience with streaming and real-time AI applicationsKnowledge of AI safety and responsible AI practicesExperience with specific domains (NLP, computer vision, etc.)If your Interested Please Share your updated CV to style="font-weight : bold;">
Note :
Only Healthcare / Legal / Finance Domains will be Considered.
(ref : hirist.tech)