Required Skills
3+years of experience in machine learning, deep learning, or Al research, With a focus on generative models.
Experience With generative models such as GANS (Generative Adversarial Networks), VAEs (Variational Autoencoders), and transformer-based models (e.g., GPT-3 / 4, BERT, DALL.E) RAG and Prompt Engineering
Understanding of model fine-tuning, transfer learning, and prompt engineering in the context Of large language models (LLMS).
Strong programming skills in python and familiarity with relevant libraries and frameworks.
Hands-on experience in building data applications using Streamlit or similar tools
Deep understanding of transformer architecture, including multi-head attention, layer normalization, and residual connections
Proficiency with Hugging Face Transformers, PyTorch or TensorFlow
Experience in training and deploying LLMs for tasks like text generation, summarization, or translation
Familiarity with AdamW, learning rate warm-up, and optimization strategies
Exposure to advanced transformer variants (e.g., Transformer-XL, Longformer, BigBird, LoRA)
Solid debugging and performance tuning skills
Good-to-Have
Experience with Docker, Kubernetes, and MLOps
Knowledge of CI / CD pipelines and model deployment best practices
Gen Ai Engineer • Vizianagaram, Andhra Pradesh, India