Talent.com
This job offer is not available in your country.
Python Data Engineer

Python Data Engineer

Delphie Consulting servicesHyderabad
29 days ago
Job description

Role Overview :

We are looking for an experienced Python Data Engineer with strong expertise in PySpark, distributed data engineering, and LLM integration.

The role involves building scalable data pipelines, AI-powered workflows, and enabling data-driven automation using platforms such as Databricks, AWS EMR, and LangChain.

As an SE3 engineer, you will primarily be responsible for hands-on development and delivery, while also contributing to solution design and collaborating with cross-functional teams.

Key Responsibilities :

  • Develop and optimize ETL pipelines using PySpark, SparkSQL, and distributed frameworks.
  • Work with LangChain to integrate LLM-based solutions into data workflows (Agents, Toolkits, Vector Stores).
  • Implement data transformations, lineage, and governance controls in data platforms.
  • Support ML teams in deploying embeddings, retrieval-augmented generation (RAG), and NLP pipelines.
  • Build workflows on Databricks and AWS EMR, ensuring cost and performance efficiency.
  • Apply best practices in coding, testing, CI / CD, and Skills :
  • Strong proficiency in Python for data engineering.
  • Hands-on experience in PySpark, SparkSQL, and ETL design.
  • Working knowledge of LangChain, OpenAI APIs, or Hugging Face.
  • Experience with Databricks, AWS EMR, or similar cloud data platforms.
  • Good understanding of SQL, data modeling, and distributed data Skills :
  • Familiarity with Google ADK Prompt Engineering.
  • Experience with vector databases like FAISS, Pinecone, or Chroma.
  • Exposure to MLflow, Unity Catalog, or SageMaker.
  • Interest in LLM-powered applications and generative AI workflows

(ref : hirist.tech)

Create a job alert for this search

Data Engineer Python • Hyderabad