This job offer is not available in your country.

Senior CV / LLM Engineer

doAZThiruvananthapuram, IN

4 hours ago

Job description

About Doaz

Doaz is a hyper-growth startup on a mission to turn fragmented industrial knowledge into instant, actionable insight. We build LLM- and Vision-AI solutions for construction, heavy industry, and finance leaders who must transform terabytes of drawings, specifications, and regulations into real-time decisions.
We’re expanding our GeoAI programs (including joint work with POSCO E&C) and launching drawing-change detection services that automatically compare plan versions, detect deltas, and explain design impacts.

Why You’ll Love Working Here

True 0-to-1 ownership — Ship models that land in production sites within weeks.

Global impact, lean crew — 30 teammates across KR🇰🇷 / PK🇵🇰 / IN🇮🇳; no bureaucracy, only builders.

Tech freedom — YOLO or RT-DETR? Gemma-VL, Qwen-VL, or LLaVA? You choose, we fund.

Role Overview

We’re hiring a Senior Computer Vision & Multimodal LLM Engineer (GeoAI & Drawing Change Analysis).

You’ll lead end-to-end development of a version-aware drawing-diff engine (PDF / DWG raster & vector), symbol / text extraction, and change-impact narratives powered by RAG / LLM. Expect fast cycles from prototype → service : detection models, OCR / layout understanding, retrieval, and explainable outputs that engineers can trust.

Key Responsibilities

Drawing Change Analysis (CV)

Build a robust diff pipeline for architectural / structural / MEP drawings : rasterization, layer parsing, vector geometry ops, and semantic change clustering.

Train / finetune detectors & segmenters (e.g., YOLOv8 / RT-DETR / Detectron2 / SAM) for symbols (columns, openings, sleeves), title blocks, and revision clouds; achieve production-grade mAP / F1.

Implement geometry-aware post-processing (IoU / topology checks, snapping, graph connectivity) to reduce false positives.

Document & Layout Understanding

Engineer OCR + layout models (PaddleOCR / Tesseract + DocFormer / LayoutLMv3 / Donut) to read legends, notes, schedules, and BOM tables; normalize to structured JSON.

Build version-aware entity tracking (IDs, gridlines, BH IDs, coordinates) across revisions.

GeoAI & LLM / RAG

Design retrieval over drawings / specs (BM25 + vector) with reranking; ground LLM answers in evidence with citations and clickable locations.

Generate change-impact summaries (e.g., slab shear reinforcement, opening proximity to columns) with rules + LLM verification; measure factual precision.

Productization & DevOps

Ship FastAPI / gRPC microservices, batch & streaming workers (Ray / Celery), GPU inference (Triton / TensorRT), and observability (Prometheus / Grafana).

Own evaluation : dataset curation, data labeling guidelines, ablation / A-B tests, and regression suites.

Collaboration

Work closely with domain SMEs (geotech / structural) to encode rules (KDS / KBC, internal standards) and prioritize what matters to the field.

Minimum Qualifications

5+ years of production Python (3.x) building ML-heavy backends; strong PyTorch.

3+ years in computer vision for detection / segmentation / OCR or document AI at scale.

Hands-on with multimodal LLM / RAG (LangChain / LlamaIndex), vector DBs (Pinecone / Weaviate / FAISS), and rerankers.

Proven experience parsing engineering drawings or complex PDFs (vector / raster), including geometry and layout reasoning.

Solid MLOps : reproducible training, CI / CD, model packaging, monitoring; cloud on AWS / GCP.

Fluent written & spoken English (Korean a plus).

Preferred Extras

GPU orchestration (Kubernetes / Ray / Slurm), high-performance inference (ONNX / TensorRT).

Experience with VLMs (Gemma-VL, Qwen-VL, LLaVA), CLIP, or doc-layout models.

Open-source contributions, papers, or strong public demos in CV / doc AI / RAG.

Full-stack chops (TypeScript / Next.js / React) for quick operator tools and review UIs.

Compensation & Benefits

Competitive base salary (market-leading) , around 20 lakh (yearly)

Performance-based annual bonus (up to 20%).

cloud credits, and AI tools support.

Hiring Process (≈ 2–3 weeks)

Quick intro call (15 min, mutual fit).

48-hour take-home : Drawing Diff + Evidence-Grounded Summary (provide code + short README; clarity >

polish).

Deep-dive tech interview : architecture, modeling choices, evaluation, and scaling plan.

Culture & vision chat with Founder / CEO.

Offer — if all green, written offer within 24 h.

How to Apply

Email doaz@doaz.ai with subject [CV / LLM Engineer – Your Name] and include :

Résumé / CV with measurable outcomes (metrics, latency, cost, accuracy).

Current or recent salary.

GitHub and / or live demos of CV / doc-AI / RAG work (links preferred).

A one-page diagram of your “Drawing Revision → Detection → Evidence → LLM Narrative” pipeline, noting models, retrieval, and evaluation metrics.

Employment type : Full-time

Ready to turn messy drawings and specs into instant, trusted intelligence?

Let’s build the future together at Doaz.

Create a job alert for this search

Senior Engineer • Thiruvananthapuram, IN