Job Summary :
We are seeking a highly skilled and motivated Data Scientist to join our Data & AI team. The candidate should have strong expertise in data analytics, machine learning, and Gen AI, with hands-on experience building and operationalizing ETL pipelines, LLM-based applications, and agentic AI workflows.
Key Responsibilities
1. Data Engineering & ETL Development :
- Design, build, and maintain end-to-end ETL pipelines for data ingestion, transformation, and storage using tools like Azure Data Factory, Databricks, PySpark, and SQL.
- Work with structured and unstructured data from diverse sources including APIs, data lakes, and streaming platforms (e.g., Kafka, Event Hubs).
2. AI, LLM, and Machine Learning Development :
Design and implement machine learning models for classification, regression, clustering, NLP, and recommendation systems using Python, scikit-learn, TensorFlow, or PyTorch.Build agentic AI workflows using frameworks such as LangChain, Autogen etc to orchestrate autonomous reasoning, decision-making, and task automation.Integrate retrieval-augmented generation (RAG) pipelines using vector databases like FAISS, Pinecone, or Azure Cognitive Search for knowledge-grounded AI systems.Utilize Azure AI Services, including Azure OpenAI, Cognitive Services, and Azure Machine Learning, to build and deploy LLM-powered solutions.3. Azure Cloud Platform Experience :
Strong hands-on experience with the Microsoft Azure ecosystem, Azure Data Factory (ADF), Azure Databricks, Azure Synapse Analytics, Azure OpenAI Service, Azure Cognitive Services, Azure Blob Storage, Azure Key Vault etc.4. Client & Stakeholder Collaboration :
Partner with business and technical stakeholders to gather requirements, define KPIs, and develop data-driven solutions aligned with client goals.Present findings, visualizations, and model outputs clearly to both technical and non-technical audiences.Support solution design discussions for data and AI-driven projects.Note : Candidates with Azure certifications will be given preference