The CV + Edge AI Engineer develops AI solutions for visual understanding, document parsing, and multimodal processing. This role enables new use cases beyond text by integrating image, OCR, and edge-deployable AI capabilities.
Key Duties / Responsibilities
- Develop OCR and image processing pipelines using OpenCV, Tesseract, or AWS Textract.
- Train and fine-tune visual models (e.g., YOLOv8, SAM, CLIP) for internal PoCs.
- Integrate visual AI modules into end-to-end workflows used by LLMs or agents.
- Optimize models for edge deployments using ONNX, TensorRT, or TFLite.
- Collaborate with backend and AI teams for data structure alignment.
Leadership Skills :
Self-driven in visual AI exploration.Effective in prototyping and cross-functional collaboration.Ability to demonstrate impact through PoCs.Required Technical Skills :
OpenCV, YOLOv8, SAM, BLIP2, Tesseract.ONNX, TensorRT, AWS Panorama, Jetson Nano.Python, PyTorch / TensorFlow, edge deployment toolchains.Experience with one or more visual AI stacks such as YOLOv8, SAM, CLIP, or equivalent.Capability to structure visual outputs for downstream agent or LLM processing.Qualification :
Bachelor s or Masters degree in Computer Science, AI / ML, Data Science, or related fields.Skills Required
Python, Opencv, SAM, TesserAct