Talent.com
This job offer is not available in your country.
Senior CV / LLM Engineer

Senior CV / LLM Engineer

doAZThiruvananthapuram, IN
4 hours ago
Job description

Job description

About Doaz

  • Doaz is a hyper-growth startup on a mission to turn fragmented industrial knowledge into instant, actionable insight. We build LLM- and Vision-AI solutions for construction, heavy industry, and finance leaders who must transform terabytes of drawings, specifications, and regulations into real-time decisions.
  • We’re expanding our GeoAI programs (including joint work with POSCO E&C) and launching drawing-change detection services that automatically compare plan versions, detect deltas, and explain design impacts.

Why You’ll Love Working Here

  • True 0-to-1 ownership — Ship models that land in production sites within weeks.
  • Global impact, lean crew — 30 teammates across KR🇰🇷 / PK🇵🇰 / IN🇮🇳; no bureaucracy, only builders.
  • Tech freedom — YOLO or RT-DETR? Gemma-VL, Qwen-VL, or LLaVA? You choose, we fund.
  • Role Overview

    We’re hiring a Senior Computer Vision & Multimodal LLM Engineer (GeoAI & Drawing Change Analysis).

    You’ll lead end-to-end development of a version-aware drawing-diff engine (PDF / DWG raster & vector), symbol / text extraction, and change-impact narratives powered by RAG / LLM. Expect fast cycles from prototype → service : detection models, OCR / layout understanding, retrieval, and explainable outputs that engineers can trust.

    Key Responsibilities

    Drawing Change Analysis (CV)

  • Build a robust diff pipeline for architectural / structural / MEP drawings : rasterization, layer parsing, vector geometry ops, and semantic change clustering.
  • Train / finetune detectors & segmenters (e.g., YOLOv8 / RT-DETR / Detectron2 / SAM) for symbols (columns, openings, sleeves), title blocks, and revision clouds; achieve production-grade mAP / F1.
  • Implement geometry-aware post-processing (IoU / topology checks, snapping, graph connectivity) to reduce false positives.
  • Document & Layout Understanding

  • Engineer OCR + layout models (PaddleOCR / Tesseract + DocFormer / LayoutLMv3 / Donut) to read legends, notes, schedules, and BOM tables; normalize to structured JSON.
  • Build version-aware entity tracking (IDs, gridlines, BH IDs, coordinates) across revisions.
  • GeoAI & LLM / RAG

  • Design retrieval over drawings / specs (BM25 + vector) with reranking; ground LLM answers in evidence with citations and clickable locations.
  • Generate change-impact summaries (e.g., slab shear reinforcement, opening proximity to columns) with rules + LLM verification; measure factual precision.
  • Productization & DevOps

  • Ship FastAPI / gRPC microservices, batch & streaming workers (Ray / Celery), GPU inference (Triton / TensorRT), and observability (Prometheus / Grafana).
  • Own evaluation : dataset curation, data labeling guidelines, ablation / A-B tests, and regression suites.
  • Collaboration

  • Work closely with domain SMEs (geotech / structural) to encode rules (KDS / KBC, internal standards) and prioritize what matters to the field.
  • Minimum Qualifications

  • 5+ years of production Python (3.x) building ML-heavy backends; strong PyTorch.
  • 3+ years in computer vision for detection / segmentation / OCR or document AI at scale.
  • Hands-on with multimodal LLM / RAG (LangChain / LlamaIndex), vector DBs (Pinecone / Weaviate / FAISS), and rerankers.
  • Proven experience parsing engineering drawings or complex PDFs (vector / raster), including geometry and layout reasoning.
  • Solid MLOps : reproducible training, CI / CD, model packaging, monitoring; cloud on AWS / GCP.
  • Fluent written & spoken English (Korean a plus).
  • Preferred Extras

  • GPU orchestration (Kubernetes / Ray / Slurm), high-performance inference (ONNX / TensorRT).
  • Experience with VLMs (Gemma-VL, Qwen-VL, LLaVA), CLIP, or doc-layout models.
  • Open-source contributions, papers, or strong public demos in CV / doc AI / RAG.
  • Full-stack chops (TypeScript / Next.js / React) for quick operator tools and review UIs.
  • Compensation & Benefits

  • Competitive base salary (market-leading) , around 20 lakh (yearly)
  • Performance-based annual bonus (up to 20%).
  • cloud credits, and AI tools support.
  • Hiring Process (≈ 2–3 weeks)

  • Quick intro call (15 min, mutual fit).
  • 48-hour take-home : Drawing Diff + Evidence-Grounded Summary (provide code + short README; clarity >
  • polish).

  • Deep-dive tech interview : architecture, modeling choices, evaluation, and scaling plan.
  • Culture & vision chat with Founder / CEO.
  • Offer — if all green, written offer within 24 h.
  • How to Apply

    Email doaz@doaz.ai with subject [CV / LLM Engineer – Your Name] and include :

  • Résumé / CV with measurable outcomes (metrics, latency, cost, accuracy).
  • Current or recent salary.
  • GitHub and / or live demos of CV / doc-AI / RAG work (links preferred).
  • A one-page diagram of your “Drawing Revision → Detection → Evidence → LLM Narrative” pipeline, noting models, retrieval, and evaluation metrics.
  • Employment type : Full-time

    Ready to turn messy drawings and specs into instant, trusted intelligence?

    Let’s build the future together at Doaz.

    Create a job alert for this search

    Senior Engineer • Thiruvananthapuram, IN