Jobs Title : Founding AI Engineer – Computer / Browser Use Systems
Location : Gurugram (onsite)
Experience : 3+ years
Type : Full-time | Start : Immediate
Comp : Cash + 1.5-4.5%
About Ourguide AI
We’re building an AI desktop app that can see your screen and take the next step for you—a true computer-use copilot. You’ll work directly with the founders (IIT / Purdue) and own core pieces of the AI engine.
What You’ll Do
- Design multimodal “computer-use” agents that use screenshots + text to decide the next action.
- Implement tool-calling agents that trigger APIs, clicks, typing, and workflows reliably.
- Build end-to-end LLM / VLM pipelines : data prep, training / fine-tuning, and low-latency inference.
- Integrate AI with desktop / browser automation (e.G., Playwright / OS-level control).
- Own logging, evaluation, and observability for agent runs and user flows.
Must-Have Skills
Strong coding in Python and comfort with production systems.Experience with LLMs & agents : OpenAI / Bedrock / Anthropic APIs, LangChain / LlamaIndex or custom.Solid understanding of fine-tuning, RAG, and prompt design.Hands-on with vision / multimodal models (e.G., YOLO or similar detection models, or GPT-4V / Claude VLM).Experience with vector DBs (Pinecone, Qdrant, PGVector, etc.) and Postgres or similar.Comfortable with AWS / cloud, Docker, Linux.Nice to Have
Open-source contributions, serious side projects, or demos in LLMs / agents / vision.Prior startup experience or clear ownership mindset.If you’ve built real systems, not just notebooks—and want to work on agents that actually use a computer, APPLY HERE.
Contact (WhatsApp) :
Eshaan Gulati;1 617 721 7143