Talent.com
This job offer is not available in your country.
Data Engineer - Python / Spark

Data Engineer - Python / Spark

VariteHyderabad
21 days ago
Job description

About The Job :

  • Develops technical tools and programming to cleanse, organize and transform data and to maintain, protect and update data structures and integrity on an automated basis.
  • Applies data extraction, transformation, and loading techniques in order to tie together large data sets from a variety of sources.
  • Partners with both internal & external sources to design, build and oversee the deployment and operation of technology architecture, solutions and software.
  • Designs, develops and programs methods, processes and systems to capture, manage, store and utilize structured and unstructured data to generate actionable insights and solutions.
  • Responsible for the maintenance, improvement, cleaning, and manipulation of data in the business client's operational and analytics databases.
  • Proactively analyzes and evaluates the business client's databases in order to identify and recommend improvements and optimization.

Essential Job Functions :

  • Uses knowledge of existing and emerging data science engineering principles, theories, and techniques to inform business decisions; and produce accurate business insights.
  • Completes projects and assignments of moderate scope and complexity under normal supervision to ensure customer and business needs are met.
  • Applies discretion and independent judgement to interpret data trends and summarize data insights.
  • Assists in the preliminary data exploration, data preparation for accurate model development.
  • Establishes working relationships with others outside area of Data Science Engineering expertise.
  • Prepares presentations of project outputs for external customers with assistance.
  • Design, develop, and maintain scalable data pipelines and systems for data processing.
  • Utilize Data Lakehouse, Spark on Kubernetes and related technologies to manage large-scale data processing.
  • Perform data ingestion from various sources like API's, RDBMS, NoSQL DB's, Kafka, Middleware & Files using Spark and process data into Lakehouse platform.
  • Develop and maintain py-spark scripts for automation of data processing tasks.
  • Implement full and incremental data loading strategies to ensure data consistency and availability.
  • Orchestrate and monitor workflows using Apache Airflow.
  • Ensure code quality and version control using GIT.
  • Troubleshoot and resolve data-related issues in a timely manner.
  • Stay up-to-date with the latest industry trends and technologies to continuously improve our data infrastructure.
  • Design, develop, and maintain scalable data pipelines and systems for data processing.
  • Utilize Data Lakehouse, Spark on Kubernetes and related technologies to manage large-scale data processing.
  • Perform data ingestion from various sources like API's, RDBMS, NoSQL DB's, Kafka, Middleware &
  • Files using Spark and process data into Lakehouse platform.
  • Develop and maintain py-spark scripts for automation of data processing tasks.
  • Implement full and incremental data loading strategies to ensure data consistency and availability.
  • Orchestrate and monitor workflows using Apache Airflow.
  • Ensure code quality and version control using GIT.
  • Troubleshoot and resolve data-related issues in a timely manner.
  • Stay up-to-date with the latest industry trends and technologies to continuously improve our data :
  • Proven experience as a Data Engineer (ETL, data warehousing, data Lakehouse).
  • Strong knowledge of Spark on Kubernetes, S3 and Docker Images.
  • Proficiency in Data engineering techniques with Py-spark.
  • Strong experience in Data warehousing techniques like data mining, data analysis, data profiling.
  • Experience with Python scripting for automation.
  • Expertise in full and incremental data loading techniques.
  • Excellent problem-solving skills and attention to detail.
  • Ability to work collaboratively in a team environment and communicate effectively with stakeholders.
  • Good to have :

  • Understanding of streaming data applications using.
  • Hands-on experience with Apache Airflow for workflow orchestration.
  • Proficiency with GIT for version control
  • Understanding of data engineering integration with LLMs or GEN-AI applications and Vector DB.
  • Knowledge on Shell scripting Postgres SQL or SQL server or MSBI.
  • (ref : hirist.tech)

    Create a job alert for this search

    Data Engineer • Hyderabad

    Related jobs
    • Promoted
    Data Engineer

    Data Engineer

    Innodata Inc.Hyderabad, IN
    CI / CD practices, Databricks (Spark), Python, Github and SQL.The ideal candidate should have hands-on expertise in building and automating data pipelines, managing multi-environment deployments, and...Show moreLast updated: 25 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    StraiveHyderabad, Telangana, India
    The ideal candidate is a strong software engineer with hands-on experience in Spark (3.You'll be responsible for designing and implementing ETL / ELT solutions, collaborating with teams to deliver da...Show moreLast updated: 3 days ago
    • Promoted
    Big Data Engineer - Python / PySpark

    Big Data Engineer - Python / PySpark

    People Prime World WideHyderabad
    Key Responsibilities : - Design, develop, and maintain scalable ETL / ELT pipelines using Python and PySpark.Work with large-scale datasets across di...Show moreLast updated: 10 days ago
    • Promoted
    Cloud Data Engineer - PySpark / Python

    Cloud Data Engineer - PySpark / Python

    NPG ConsultantsHyderabad
    Cloud Data Engineer : Build scalable, cloud-native data solutions using 5+ years of hands-on experience.Transform complex datasets into reliable ins...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    DeltacubesHyderabad, IN
    Build and maintain scalable ETL / ELT pipelines.Work with Snowflake and BigQuery for data storage.Implement orchestration with Airflow or Prefect. Integrate data workflows with Python.Optimize data pi...Show moreLast updated: 15 days ago
    • Promoted
    Data Engineer

    Data Engineer

    ACL Digitalsecunderabad, telangana, in
    Design, develop, and optimize Spark-based data pipelines on Databricks for large-scale data processing.Design, develop, and optimize AWS pipeline as applicable. Implement and manage GitHub asset bun...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Bahwan CyberTeksecunderabad, telangana, in
    Job Title : Data Engineer – Google Cloud Platform (GCP).We are seeking a skilled and motivated Data Engineer with hands-on experience in building scalable data pipelines and cloud-native data soluti...Show moreLast updated: 17 days ago
    • Promoted
    Data Engineer - Python / PySpark

    Data Engineer - Python / PySpark

    MEEDEN LABS PRIVATE LIMITEDHyderabad
    Job Summary : We are seeking a skilled Data Engineer with strong expertise in Python and PySpark to design, develop, and optimize large-scale data ...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    ADPhyderabad, telangana, in
    Below is the JD followed for Data Engineering.Position : Data Engineer with 4 to Y8ears experience in AWS, PySpark, Python, SQL and Data bricks. In this role candidate will be responsible for unders...Show moreLast updated: 26 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Rrootshell Technologiiss Pvt Ltdhyderabad, telangana, in
    Hope you are doing well & Safe!.Rrootshell Technologiiss Pvt Ltd.We are HIRING & URGENT Requirement for.This is for FULL -TIME role and Work From Office (WFO) Opportunity. Location : Hyderabad, Work ...Show moreLast updated: 20 days ago
    • Promoted
    Data Engineer

    Data Engineer

    Egenhyderabad, telangana, in
    Lead Data Engineer – Python & GCP.We are looking for a skilled and motivated Lead Data Engineer with strong experience in Python programming and Google Cloud Platform (GCP) to join our data enginee...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Saras Analyticshyderabad, telangana, in
    As a Data Engineer, you will be responsible for designing, building, and maintaining robust data pipelines and workflows. You will work closely with data analysts, data scientists, and other stakeho...Show moreLast updated: 7 days ago
    • Promoted
    Senior Data Engineer - Python / Spark

    Senior Data Engineer - Python / Spark

    HIMFLAX INFORMATION TECHNOLOGIES PRIVATE LIMITEDHyderabad
    Key Responsibilities : Data Architecture & Pipeline Development : - Design, implement, and mainta...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Engineer - AWS / Python

    Senior Data Engineer - AWS / Python

    Servhigh Global Services Private LimitedHyderabad
    Job Description : We are looking for a highly skilled Senior Data Engineer with strong expertise in AWS, Python, and PySpark to join our team.Key Responsibilities : &l...Show moreLast updated: 27 days ago
    • Promoted
    • New!
    Data Engineer

    Data Engineer

    TechVeritohyderabad, telangana, in
    You will play a critical role in designing, building, and optimizing data workflows that enable scalable analytics and real-time insights. The ideal candidate is hands-on, detail-oriented, and passi...Show moreLast updated: 6 hours ago
    • Promoted
    Data Engineer III - Python / PySpark

    Data Engineer III - Python / PySpark

    HyrEzy Talent SolutionsHyderabad
    About Company : We are a next-generation AI and Cloud Transformation company driving innovation at the intersection of technology and bus...Show moreLast updated: 21 days ago
    • Promoted
    Azure Data Engineer - Python / PySpark

    Azure Data Engineer - Python / PySpark

    GALAXY I TECHNOLOGIES INCHyderabad
    Role Description : We are seeking a highly skilled Azure Data Engineer to join our team.The ideal candidate will be responsible for designing, building, and maintaini...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    INFEC ServicesHyderabad, Telangana, India
    Design, develop, and optimize data pipelines and ETL processes on GCP or Azure.Work with structured and unstructured data, integrating sources such as databases, APIs, and streaming platforms.Imple...Show moreLast updated: 30+ days ago