Talent.com
Data Engineer - Python/Spark
Data Engineer - Python/SparkVarite • Hyderabad
Data Engineer - Python / Spark

Data Engineer - Python / Spark

Varite • Hyderabad
30+ days ago
Job description

About The Job :

  • Develops technical tools and programming to cleanse, organize and transform data and to maintain, protect and update data structures and integrity on an automated basis.
  • Applies data extraction, transformation, and loading techniques in order to tie together large data sets from a variety of sources.
  • Partners with both internal & external sources to design, build and oversee the deployment and operation of technology architecture, solutions and software.
  • Designs, develops and programs methods, processes and systems to capture, manage, store and utilize structured and unstructured data to generate actionable insights and solutions.
  • Responsible for the maintenance, improvement, cleaning, and manipulation of data in the business client's operational and analytics databases.
  • Proactively analyzes and evaluates the business client's databases in order to identify and recommend improvements and optimization.

Essential Job Functions :

  • Uses knowledge of existing and emerging data science engineering principles, theories, and techniques to inform business decisions; and produce accurate business insights.
  • Completes projects and assignments of moderate scope and complexity under normal supervision to ensure customer and business needs are met.
  • Applies discretion and independent judgement to interpret data trends and summarize data insights.
  • Assists in the preliminary data exploration, data preparation for accurate model development.
  • Establishes working relationships with others outside area of Data Science Engineering expertise.
  • Prepares presentations of project outputs for external customers with assistance.
  • Design, develop, and maintain scalable data pipelines and systems for data processing.
  • Utilize Data Lakehouse, Spark on Kubernetes and related technologies to manage large-scale data processing.
  • Perform data ingestion from various sources like API's, RDBMS, NoSQL DB's, Kafka, Middleware & Files using Spark and process data into Lakehouse platform.
  • Develop and maintain py-spark scripts for automation of data processing tasks.
  • Implement full and incremental data loading strategies to ensure data consistency and availability.
  • Orchestrate and monitor workflows using Apache Airflow.
  • Ensure code quality and version control using GIT.
  • Troubleshoot and resolve data-related issues in a timely manner.
  • Stay up-to-date with the latest industry trends and technologies to continuously improve our data infrastructure.
  • Design, develop, and maintain scalable data pipelines and systems for data processing.
  • Utilize Data Lakehouse, Spark on Kubernetes and related technologies to manage large-scale data processing.
  • Perform data ingestion from various sources like API's, RDBMS, NoSQL DB's, Kafka, Middleware &
  • Files using Spark and process data into Lakehouse platform.
  • Develop and maintain py-spark scripts for automation of data processing tasks.
  • Implement full and incremental data loading strategies to ensure data consistency and availability.
  • Orchestrate and monitor workflows using Apache Airflow.
  • Ensure code quality and version control using GIT.
  • Troubleshoot and resolve data-related issues in a timely manner.
  • Stay up-to-date with the latest industry trends and technologies to continuously improve our data :
  • Proven experience as a Data Engineer (ETL, data warehousing, data Lakehouse).
  • Strong knowledge of Spark on Kubernetes, S3 and Docker Images.
  • Proficiency in Data engineering techniques with Py-spark.
  • Strong experience in Data warehousing techniques like data mining, data analysis, data profiling.
  • Experience with Python scripting for automation.
  • Expertise in full and incremental data loading techniques.
  • Excellent problem-solving skills and attention to detail.
  • Ability to work collaboratively in a team environment and communicate effectively with stakeholders.
  • Good to have :

  • Understanding of streaming data applications using.
  • Hands-on experience with Apache Airflow for workflow orchestration.
  • Proficiency with GIT for version control
  • Understanding of data engineering integration with LLMs or GEN-AI applications and Vector DB.
  • Knowledge on Shell scripting Postgres SQL or SQL server or MSBI.
  • (ref : hirist.tech)

    Create a job alert for this search

    Data Engineer • Hyderabad

    Related jobs
    Senior Data Engineer - AWS & Python

    Senior Data Engineer - AWS & Python

    Egen • hyderabad, telangana, in
    Design, develop, and maintain ETL / ELT data pipelines using Python and AWS native services (Glue, Lambda, EMR, Step Functions, etc. Build and manage data lakes and data warehouses using Amazon S3, Re...Show more
    Last updated: 7 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Straive • Hyderabad, Telangana, India
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show more
    Last updated: 30+ days ago • Promoted
    Lead Python Data Engineer _ Exp : 7+ Years

    Lead Python Data Engineer _ Exp : 7+ Years

    Atyeti Inc • hyderabad, telangana, in
    Bachelor’s or Master’s degree in Computer Science or equivalent experience.Application Developer or in similar software engineering roles. Python; strong SQL and cloud-native development experience ...Show more
    Last updated: 13 days ago • Promoted
    Data Engineer - Python / Apache Spark

    Data Engineer - Python / Apache Spark

    BOHIYAANAM TECHNOLOGY LLP • Hyderabad
    Description : Key Responsibilities : - Design, develop, and deploy end-to-end da...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    IntraEdge • Hyderabad, IN
    Python, PySpark, AWS services (Glue, Lambda), and Snowflake.The ideal candidate will design, build, and maintain scalable data pipelines, ensure efficient data integration, and enable advanced anal...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer - Snowflake

    Data Engineer - Snowflake

    Prudent Technologies and Consulting, Inc. • hyderabad, telangana, in
    We are seeking a skilled Data Engineer with strong experience in Python, Snowflake, and AWS.The ideal candidate will be responsible for building and optimizing scalable data pipelines, integrating ...Show more
    Last updated: 9 days ago • Promoted
    Data Engineer (AWS / PySpark)

    Data Engineer (AWS / PySpark)

    Tata Consultancy Services • Hyderabad, Republic Of India, IN
    TCS is Hiring Data Engineer+AWS Pyspark For Hyderabad, Bangalore, Chennai location.AWS Pyspark / Python / Hive -Primary.AWS Glue, Lambda, Athena-Secondary. Good hands-on experience in Python programmin...Show more
    Last updated: 3 days ago • Promoted
    AWS Data Engineer - Python / PySpark

    AWS Data Engineer - Python / PySpark

    Deqode • Hyderabad
    Key Responsibilities : - Design, develop, and maintain scalable data pipelines and architectures using AWS services.Implement ETL / ELT processes us...Show more
    Last updated: 30+ days ago • Promoted
    Lead Data Engineer

    Lead Data Engineer

    People Prime Worldwide • Hyderabad, India
    Design, develop, test, and maintain scalable ETL data pipelines using Python.Work extensively on Google Cloud Platform (GCP) services such as : . Dataflow for real-time and batch data processing.Cloud...Show more
    Last updated: 30+ days ago • Promoted
    Senior Python Data Engineer

    Senior Python Data Engineer

    iVoyant • secunderabad, telangana, in
    Join a dynamic engineering team working on a high-impact tax reporting platform for the 2025 fiscal season.The core goal is to modernize and significantly accelerate the generation of Excel-based r...Show more
    Last updated: 9 days ago • Promoted
    PySpark Data Engineer

    PySpark Data Engineer

    EXTRAGIG • Hyderabad, IN
    Contract Assistant – Data Engineer Support (Remote, EST Hours).PySpark Data Engineer with daily activities.This is a remote contract role. Execute creative software and data solutions, including des...Show more
    Last updated: 30+ days ago • Promoted
    Lead Data Pipeline Engineer

    Lead Data Pipeline Engineer

    Straive • Hyderabad, Republic Of India, IN
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show more
    Last updated: 30+ days ago • Promoted
    Data Solutions Engineer

    Data Solutions Engineer

    Straive • Hyderabad, Republic Of India, IN
    Design, build and maintain scalable.Implement core ETL / ELT logic in Scala and Python;.Write and optimize complex SQL for ingestion, transformation and consumption layers. Tune Spark jobs for perform...Show more
    Last updated: 9 days ago • Promoted
    Data Engineer

    Data Engineer

    Straive • Hyderabad, Telangana, India
    Design, build and maintain scalable.Implement core ETL / ELT logic in Scala and Python; author efficient Spark DataFrame / Dataset jobs. Write and optimize complex SQL for ingestion, transformation and ...Show more
    Last updated: 9 days ago • Promoted
    Data Engineer

    Data Engineer

    Aceolution • secunderabad, telangana, in
    Data Engineer – Python Expert(Freelance Role).We are looking for a seasoned Senior Data Engineer to architect, build, and own the data pipelines that power our large language model (LLM) developmen...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer III - Python / PySpark

    Data Engineer III - Python / PySpark

    HyrEzy Talent Solutions • Hyderabad
    About Company : We are a next-generation AI and Cloud Transformation company driving innovation at the intersection of technology and bus...Show more
    Last updated: 30+ days ago • Promoted
    EverestDX - Senior Data Engineer - Python / PySpark

    EverestDX - Senior Data Engineer - Python / PySpark

    EverestDX • Hyderabad
    Description : Job Summary : - We are seeking an experienced Azure Data Engineer / Lead with strong expertise in Azure servic...Show more
    Last updated: 7 days ago • Promoted
    Senior Python Data Engineer

    Senior Python Data Engineer

    SIRO • secunderabad, telangana, in
    Good Python Programming with 6 years of experience; PyCharm; Well versed in AWS tools , will be good if AWS architect certified. GitHub; GitAction; Experience in deploying Data Pipelines.Good commu...Show more
    Last updated: 14 days ago • Promoted