We are seeking talented individuals who have recently completed their PhD to join our cutting-edge team. This position is specifically designed for fresh PhD graduates looking to apply their research expertise in a dynamic industry setting. In this role, you'll tackle complex challenges in large language models (LLMs), optical character recognition (OCR), and model scaling. You'll be at the forefront of developing and optimizing AI systems that push the boundaries of what's possible in machine learning.
Responsibilities :
- Lead research initiatives to improve OCR accuracy across diverse document types and languages.
- Train and fine-tune LLMs using domain-specific data to enhance performance in specialized contexts.
- Develop techniques to scale LLMs efficiently for high-volume production environments.
- Design and implement novel approaches to model optimization and evaluation.
- Collaborate with cross-functional teams to integrate AI solutions into production systems.
- Stay current with the latest research and incorporate state-of-the-art techniques.
- Document methodologies, experiments, and findings for both technical and non-technical audiences.
Requirements :
PhD or Master's in Computer Science, Machine Learning, AI, or a related field.Strong understanding of deep learning architectures, particularly transformer-based models.Experience with OCR systems and techniques for improving text recognition accuracy.Proficiency in Python and deep learning frameworks (PyTorch, TensorFlow, or JAX).Demonstrated ability to implement and adapt research papers into working code.Excellent problem-solving skills with a methodical approach to experimentationStrong communication skills to explain complex technical concepts clearly.Preferred Qualifications :
Research focus during PhD in areas relevant to our work (NLP, computer vision, multimodal learning).Familiarity with distributed training systems for large-scale models.Experience with model quantization, pruning, and other efficiency techniques.Understanding of evaluation methodologies for assessing model performance.Knowledge of MLOps practices and tools for model deployment.Publications at top-tier ML conferences (NeurIPS, ICML, ACL, CVPR, etc.).(ref : hirist.tech)