We at STGYA part of STI Aromas a Hong Kong based company are seeking a highly motivated and skilled AI / ML Engineer with a strong focus on training and fine-tuning Large Language Models (LLMs), Vision Language Models (VLMs), and Small Language Models (SLMs).
The ideal candidate will possess a deep understanding of deep learning principles, experience with state-of-the-art model architectures, and a proven track record of developing and deploying high-performance AI models. You will play a crucial role in advancing our AI capabilities and contributing to the development of innovative AI-powered products and services.
Responsibilities :
Model Training and Fine-tuning : Design, implement, and optimize training pipelines for LLMs, VLMs, and SLMs using large-scale datasets.
- Experiment with various training techniques, including transfer learning, reinforcement learning, and parameter-efficient fine-tuning (PEFT).
- Evaluate and improve model performance using relevant metrics and benchmarks.
- Address model biases and ensure fairness and robustness.
Model Architecture and Development : Research and implement cutting-edge model architectures and techniques for LLMs, VLMs, and SLMs.
Adapt and customize existing models to meet specific application requirements.Develop and maintain efficient code for model training and inference.Data Management and Processing : Work with large-scale datasets, including text, images, and multimodal data.
Develop data preprocessing and augmentation pipelines to improve model performance.Collaborate with data engineers to ensure data quality and availability.Infrastructure and Deployment : Optimize model training and inference for performance and scalability.
Deploy trained models to production environments.Monitor and maintain deployed models.Work with cloud based infrastructure such as AWS, GCP, or Azure.Research and Development : Stay up-to-date with the latest advancements in AI / ML, particularly in LLMs, VLMs, and SLMs.
Contribute to research projects and publications.Collaborate with other researchers and engineers to develop innovative AI solutions.Collaboration and Communication : Work closely with cross-functional teams, including product managers, data scientists, and software engineers.
Communicate technical concepts and findings effectively to both technical and non-technical audiences.Document all work clearly and effectively.Qualifications :
Technical Skills : Strong understanding of deep learning principles and techniques.Extensive experience with training and fine-tuning LLMs, VLMs, and SLMs.Proficiency in deep learning frameworks such as TensorFlow, PyTorch, or JAX.Experience with transformer-based architectures (e.g., BERT, GPT, T5, ViT).Experience with cloud computing platforms (e.g., AWS, GCP, Azure).Proficiency in Python and other relevant programming languages.Experience with version control systems (e.g., Git).Preferred Skills : Experience with distributed training and large-scale model deployment.
Knowledge of natural language processing (NLP), computer vision, and multimodal learning.Experience with reinforcement learning.Experience with prompt engineering.Experience with model quantization and pruning.Soft Skills :
Strong problem-solving and analytical skills.Excellent communication and collaboration skills.Ability to work independently and as part of a team.Strong passion for AI and machine learning.Ability to adapt to fast changing technologies.