Talent.com
Generative AI Engineer (LLMs & RAG) – Healthcare SaaS

Generative AI Engineer (LLMs & RAG) – Healthcare SaaS

Ekshvaku Tech InnovationsAjmer, IN
3 hours ago
Job description

Generative AI Engineer (LLMs & RAG) – Healthcare SaaS

Location : Remote / Hyderabad (preferred)

Experience : 5+ years

Employment Type : Full-time

Domain : Generative AI, Healthcare, SaaS

About the Role

We are looking for a hands-on Generative AI Engineer with expertise in Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG). In this role, you will architect and deliver production-grade AI systems that combine unstructured and structured healthcare data to power a next-generation clinical assistant.

You will work at the cutting edge of Generative AI and healthcare innovation, helping design safe, scalable, and impactful solutions that directly influence real-world patient outcomes. This is a chance to take ownership of core AI systems in a fast-moving, high-impact product.

Key Responsibilities

  • Architect and optimize RAG pipelines – ingestion, embedding, retrieval, grounding, and generation.
  • Experiment with LLMs – open-weight models (LLaMA, Mistral, Falcon) and API-based models (OpenAI, Anthropic).
  • Implement evaluation & monitoring frameworks (retrieval precision, grounding accuracy, hallucination checks, latency, safety).
  • Work with diverse healthcare datasets – PDFs, EHR / EMR data, CSVs, structured knowledge bases.
  • Deploy AI services to cloud-native environments (AWS, GCP, or Azure) with observability and security in mind.
  • Collaborate cross-functionally with product, clinicians, DevOps, and QA to ship production-ready features.

Must-Have Skills

  • 5+ years of experience in software / ML engineering with strong Python skills (backend + API development).
  • Proven expertise building LLM-powered applications and RAG pipelines.
  • Strong understanding of embeddings, transformers, prompt engineering / orchestration, and NLP techniques.
  • Hands-on experience with vector databases (FAISS, Pinecone, Qdrant, Weaviate, Milvus).
  • Familiarity with LangChain, LLMOps, and ML deployment workflows.
  • Experience with containerized deployments (Docker / Kubernetes) and cloud platforms.
  • Exposure to both open-weight LLMs and hosted APIs.
  • Nice to Have

  • Experience in healthcare / healthtech; knowledge of HIPAA-compliant systems.
  • Familiarity with autonomous agent frameworks (LangGraph, AutoGen, CrewAI).
  • Understanding of clinical NLP ontologies (SNOMED CT, ICD, UMLS).
  • Contributions to open-source AI / ML projects or published proofs-of-concept.
  • Experience building real-time SaaS applications powered by AI.
  • Exposure to multimodal retrieval (structured + unstructured data, including medical images).
  • Soft Skills & Expectations

  • Bias for action : eager to prototype fast while caring about reliability in a healthcare context.
  • Clear communicator who documents clean code and shares progress proactively.
  • Comfortable working independently in an ambiguous, fast-paced startup environment.
  • Flexible for early / late standups with global teams.
  • What We Offer

  • Opportunity to shape one of the first LLM-powered healthcare copilots at scale.
  • Autonomy, trust, and ownership in a results-driven team.
  • Direct mentorship from senior AI researchers and clinicians.
  • Chance to publish technical blogs or papers on healthcare AI.
  • Flexible remote setup with a Hyderabad base.
  • Competitive salary with performance bonuses.
  • Create a job alert for this search

    Generative Ai Engineer • Ajmer, IN