We’re a fast-growing startup working on a next-generation conversational platform that blends natural voice, real-time interaction, and emotional intelligence. The product is already live in private beta — we’re looking for an AI Engineer who can refine, optimize, and scale our current model integrations.
Responsibilities
- Build, fine-tune, and deploy LLM-based conversational agents using OpenAI, Gemini, or similar APIs
- Optimize real-time audio pipelines , including transcription (STT) and speech synthesis (TTS)
- Develop and test prompt-engineering logic and dynamic response generation
- Collaborate with backend engineers to improve latency, accuracy, and contextual memory
- Implement custom tools and APIs for real-time reasoning and emotion detection
- Monitor and analyze model performance, create feedback loops for continuous improvement
Requirements
Proven experience integrating OpenAI, Anthropic, Gemini, or similar LLMsStrong coding skills in Python / Node.jsUnderstanding of NLP, embeddings, prompt design, and text generation pipelinesHands-on experience with real-time systems (WebSocket, LiveKit, Twilio, etc.)Ability to debug and optimize model response times and context flowSolid foundation in API integration, cloud deployment, and model testingNice to Have
Experience with emotion recognition , voice-to-voice interfaces , or persona-based AIFamiliarity with LangChain , RAG , or vector databasesExperience in fine-tuning smaller open-source models (LLaMA, Mistral, etc.)Previous work on AI companions , counseling bots , or real-time agentsWhat We Offer
Competitive pay (monthly or milestone-based)Opportunity to work on cutting-edge real-time AI productsFast, creative environment — direct impact on user experienceLong-term potential for growth with a core founding team⚠️ Note : This project is currently in private beta. Full product details will be shared under NDA with shortlisted candidates.