Talent.com
Principal Data Engineer
Principal Data EngineerExigo Tech • Vadodara, Gujarat, India
No longer accepting applications
Principal Data Engineer

Principal Data Engineer

Exigo Tech • Vadodara, Gujarat, India
1 day ago
Job description
  • Exigo Tech is a Sydney-based Technology Solutions Provider that is focused on providing solutions on three major verticals;
  • Infrastructure, Cloud, and Application to businesses across Australia. We help companies reach operational efficiencies by empowering them with technology solutions that drive their business processes.

    Exigo is looking for Full-time Sr. Data Engineer

    We are ISO 27001 : 2022 certified organization

    Visit our website : for more details….

    LinkedIn :

    Click Here to know more : LIFE AT EXIGO TECH

    Roles and Responsibilities

    Install, configure, and manage Apache Spark (open-source) clusters on Ubuntu, including Spark master / worker nodes and Spark environment files.

    Configure and manage Spark UI and Spark History Server for monitoring jobs, analyzing DAGs, stages, tasks, and troubleshooting performance.

    Develop, optimize, and deploy PySpark ETL / ELT pipelines using DataFrame API, UDFs, window functions, caching, partitioning, and broadcasting.

    Deploy PySpark jobs using spark-submit in client / cluster mode with proper logging and error handling.

    Install, configure, and manage Apache Airflow including UI, scheduler, webserver, connections, and variables.

    Create, schedule, and monitor Airflow DAGs for PySpark jobs using SparkSubmitOperator, BashOperator, or PythonOperator.

    Configure and manage cron jobs for scheduling data processing tasks where needed.

    • Install, configure, and optimize Trino (PrestoSQL) coordinator and worker nodes;
    • configure catalogs suchas S3, MySQL, or PostgreSQL.

      Maintain Linux / Ubuntu servers including services, logs, environment variables, memory usage, and port conflict resolution.

      Design and implement scalable data architectures using Azure Data Services including ADF, Synapse, ADLS, Azure SQL, and Databricks.

      Develop, manage, and automate ETL / ELT pipelines using Azure Data Factory (Pipelines, Mapping Dataflows, Dataflows).

      Monitor, troubleshoot, and optimize data pipelines across Spark, Airflow, Trino, and Azure platforms.

      Work with structured, semi-structured, and unstructured data across multiple data sources and formats.

      Implement data analytics, transformation, backup, and recovery solutions.

      Perform data migration, upgrade, and modernization using Azure and database tools.

      Implement CI / CD pipelines for data solutions using Azure DevOps and Git.

      Ensure data quality, governance, lineage, metadata management, and security compliance across cloud and big data environments.

    • Design and optimize data models using star and snowflake schemas;
    • build data warehouses, Delta Lake, and Lakehouse systems.

      Develop and rebuild reports / dashboards using Power BI, Tableau, or similar tools.

      Collaborate with internal teams, clients, and business users to gather requirements and deliver high-quality data solutions.

      Provide documentation, runbooks, and operational guidance.

      Technical Skills :

      Apache Spark (Open Source) & PySpark - Must

      Apache Spark installation & cluster configuration (Ubuntu / Linux)

      Spark master / worker setup (standalone & cluster mode)

      Spark UI & History Server configuration and debugging

      PySpark development (ETL pipelines, UDFs, window functions, DataFrame API)

      Performance tuning (partitioning, caching, shuffles)

      spark-submit deployment with monitoring and logging

      2. Apache Airflow & Job Orchestration - Must

      Airflow installation & configuration (UI, scheduler, webserver)

      Creating and scheduling DAGs (SparkSubmitOperator, BashOperator, PythonOperator)

      Retry logic, triggers, alerting, and log management

      Cron job scheduling & process automation

      3. Trino (PrestoSQL) - Must

      Trino coordinator & worker node setup

      Catalog configuration (S3, RDBMS sources)

      Distributed SQL troubleshooting & performance optimization

      4. Azure Data Services (nice to have)

      Azure Data Factory

      Azure Synapse Analytics

      Azure SQL / Cosmos DB

      Azure Data Lake Storage (Gen2)

      Azure Databricks (Delta, Notebooks, Jobs)

      Azure Event Hubs / Stream Analytics

      5. Microsoft Fabric ( nice to have)

      Lakehouse

      Warehouse

      Dataflows

      Notebooks

      Pipelines

      6. Programming & Querying

      Python

      PySpark

      SQL

      Scala

      7. Data Modeling & Warehousing

      Star schema modeling

      Snowflake schema modeling

      Fact / dimension modeling

      Data warehouse & Lakehouse design

      Delta Lake / Lakehouse architectures

      8. DevOps & CI / CD

      Git / GitHub / Azure Repos

      Azure DevOps pipelines (CI / CD)

      Automated deployment for Spark, Airflow, ADF, Databricks, Fabric

      9. BI Tools (Nice to have)

      Power BI

      Tableau

      Report building, datasets, DAX

      10. Linux / Ubuntu Server Knowledge

      Shell scripting

      Service management

      Logs & environment variables

      Soft Skills :

      Excellent problem solving and communication skills

      Able to work well in a team setting

      Excellent organizational and time management skills

      Taking end-to-end ownership

      Production support & timely delivery

      Self-driven, flexible and innovative

      Microsoft Certified : Azure Data Engineer Associate (DP-203 / DP -300)

      Knowledge of DevOps and CI / CD pipelines in Azure

      Education :

      BSc / BA in Computer Science, Engineering or a related field

      Work Location : Vadodara, Gujarat, India

    Create a job alert for this search

    Principal Data Engineer • Vadodara, Gujarat, India

    Related jobs
    Data Engineer

    Data Engineer

    Randstad Enterprise • Vadodara, IN
    Shift Timing : 2 : 00 Pm - 11 : 00 Pm.Experience : 2- 4 years relevant Experience only ( this is a Junior position with us ). GCP - 2 years minimum working Experience.Worked with global stakeholders.Ran...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    ShimentoX Technologies • Vadodara, IN
    Data Engineer (Strong with Building data connectors).Key Skills : Python, Data Connectors, Metadata, API Integration-Rest / GraphQL. Must have proven background in building data connectors.Experience s...Show more
    Last updated: 2 days ago • Promoted
    Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

    Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

    SkillsCapital • Vadodara, Gujarat, India
    Remote
    We are hiring multiple Data Engineers to join international data platform, analytics, and cloud engineering teams.These fully remote, long-term freelance roles are ideal for engineers who can build...Show more
    Last updated: 5 days ago • Promoted
    Azure Data Engineer

    Azure Data Engineer

    Paritas Recruitment • Vadodara, IN
    Azure Data Engineer - 6 Month+ Rolling Contract.Remote - (Sunday to Thursday working days).Paritas is working with a global IT Consultancy & leading Energy client who are seeking an experienced and...Show more
    Last updated: 2 days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Ironbook AI • Vadodara, IN
    The ideal candidate will have strong experience with cloud platforms, modern ETL / ELT tools, and deep technical skills in Python, SQL, and distributed data frameworks. Design, develop, and maintain s...Show more
    Last updated: 18 hours ago • Promoted • New!
    Senior Data Engineer

    Senior Data Engineer

    Donyati • Vadodara, IN
    We are seeking a highly skilled Senior Data Engineer to join our team in building a modern data platform on AWS.You will play a key role in transitioning from legacy systems to a scalable, cloud-na...Show more
    Last updated: 16 days ago • Promoted
    Lead Data Engineer

    Lead Data Engineer

    Ironbook AI • Vadodara, IN
    We are seeking an experienced and driven Lead Data Engineer to spearhead the.AI use cases across the organization.Minimum 7 years of experience in data engineering, with at.Strong hands-on experien...Show more
    Last updated: 18 hours ago • Promoted • New!
    Data Engineer - Palantir Foundry

    Data Engineer - Palantir Foundry

    NP Group • Vadodara, Gujarat, India
    Data Engineer - Palantir Foundry, Workshop, Pyspark & Typescript Fully Remote - Long Term (initially 6 months) full time contract c$12. We have an immediate requirement for an experienced Data Eng...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    TerraGiG • Vadodara, IN
    Lead the design, development, and implementation of data solutions using AWS and Snowflake.Collaborate with cross-functional teams to understand business requirements and translate them into techni...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Tata Consultancy Services • Vadodara, IN
    TCS has been a great pioneer in feeding the fire of Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us from growing together.Your role is of key im...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    IntraEdge • Vadodara, IN
    We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show more
    Last updated: 30+ days ago • Promoted
    GCP Data Engineer

    GCP Data Engineer

    Adastra • Vadodara, IN
    We are looking for a proactive and solution-oriented GCP Data Engineer to join our team.This role requires hands-on experience in Google Cloud Platform (GCP), especially with BigQuery and Airflow, ...Show more
    Last updated: 17 days ago • Promoted
    Data Engineer

    Data Engineer

    EXL • Vadodara, IN
    The person will be part of the EDP Data Platform (EDP) team for a major Insurance client.He / She will work with different stakeholders for architecting & Building EDP application platform to suppor...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    Staffingine LLC • Vadodara, IN
    The Data Engineer will be responsible for designing, developing, and optimizing scalable data pipelines and cloud-based data solutions. This role requires strong Python programming skills, expertise...Show more
    Last updated: 7 days ago • Promoted
    Data Engineer

    Data Engineer

    BayOne Solutions • Vadodara, IN
    We are seeking a highly experienced Data Engineer to join our MarTech team and play a pivotal role in driving innovation within our microservices architecture, with a strong emphasis on data engine...Show more
    Last updated: 30+ days ago • Promoted
    Principal Data Engineer

    Principal Data Engineer

    CodeMyMobile • Vadodara, Gujarat, India
    Experience Required - 7 to 10 Years How to Apply : Are you a Data Engineer who cares about clean engineering, autonomy, and solving real data challenges? If this sounds like you, we’d love to conn...Show more
    Last updated: 30+ days ago • Promoted
    Data Engineer

    Data Engineer

    MyRemoteTeam Inc • Vadodara, IN
    MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent.We empower businesses by providing world-class software engineers, operations suppo...Show more
    Last updated: 18 hours ago • Promoted • New!
    Lead Data Engineer

    Lead Data Engineer

    Confidential • Vadodara, Gujarat, India
    Skillset Required • 7+ years of experience in software development, with a strong foundation in distributed systems, cloud-native architectures, and data platforms. Expertise in big data technologie...Show more
    Last updated: 12 hours ago • Promoted • New!