Description :
As an AI Engineer-II, you will play a critical role in designing, developing, and scaling GenAI-powered solutions for healthcare. You will work across the stack model design, RAG pipelines, deployment, monitoring, and cost optimization, ensuring our AI systems are reliable, scalable, and :
- 3+ years of hands-on experience with LLMs / GenAI (e. g., GPT, Claude, Llama, PaLM).
- 5+ years in Data Science / Applied ML, with production deployments.
- Strong expertise in RAG architectures, vector databases (Pinecone, Weaviate, Chroma), and semantic search.
- Proficiency in Python and GenAI libraries (LangChain, LlamaIndex, HuggingFace, OpenAI / Anthropic APIs).
- Experience with production GenAI deployments - monitoring, scaling, cost optimization, and MLOps practices.
Nice-to-Have (Preferred) :
Research publications or contributions in NLP / GenAI.Experience with clinical decision support systems or healthcare AI.Knowledge of distributed training / model parallelization.Exposure to healthcare NLP (medical reports, ICD-10 CPT, SNOMED).Not a Fit If :
You only have pure research experience with no production deployments.Your background is limited to rule-based NLP (without GenAI expertise).You lack a systems thinking / cost optimization mindset for scaling AI in production.(ref : hirist.tech)