LLM Trainer / Python Developer
1 Month Contract with Extension
100% Remote
Working hours for the roles below are 40 hours per week, with a required 4-hour overlap with PST (6 : 00 AM – 10 : 00 AM). The remaining hours are flexible.
Role Overview
We are seeking a Python Engineer to validate and ensure the quality of training data used in developing agentic AI systems. This role involves working with Python and JSON function-calling methods to create structured tasks, ensuring data workflows remain consistent, accurate, and aligned with cutting-edge AI requirements.
What does day-to-day look like?
Validate large-scale training datasets and maintain strict quality standards.
Develop and optimize Python-based pipelines for data processing and validation.
Create, manipulate, and structure JSON tasks for function-calling workflows.
Work closely with AI researchers and teams to align datasets with agentic AI use cases.
Monitor data accuracy, consistency, and integrity across projects.
Contribute to documentation and best practices for dataset preparation and validation.
Requirements
JSON AND (Javasctipt OR Python OR Typescript OR C#)
Strong proficiency in Python or JavaScript or Java or any other programming languages for data handling and workflow automation.
Solid understanding of SQL for querying and managing datasets.
Hands-on experience with GitHub for version control and collaboration.
Expertise in handling, creating, and validating JSON - formatted tasks.
Exceptional attention to detail with a focus on data quality and consistency.
Interest in or exposure to agentic AI, LLMs, or NLP best practices.
Bonus : Prior experience with data annotation tools or prompt engineering for Agent AI.
Work in a fully remote environment.
Opportunity to work on cutting-edge AI projects with leading LLM companies.
Python Developer • Surat, IN