Principal Data EngineerExigo Tech • Vadodara, Gujarat, India

No longer accepting applications

Principal Data Engineer

Exigo Tech • Vadodara, Gujarat, India

1 day ago

Job description

Exigo Tech is a Sydney-based Technology Solutions Provider that is focused on providing solutions on three major verticals;

Infrastructure, Cloud, and Application to businesses across Australia. We help companies reach operational efficiencies by empowering them with technology solutions that drive their business processes.

Exigo is looking for Full-time Sr. Data Engineer

We are ISO 27001 : 2022 certified organization

Visit our website : for more details….

LinkedIn :

Click Here to know more : LIFE AT EXIGO TECH

Roles and Responsibilities

Install, configure, and manage Apache Spark (open-source) clusters on Ubuntu, including Spark master / worker nodes and Spark environment files.

Configure and manage Spark UI and Spark History Server for monitoring jobs, analyzing DAGs, stages, tasks, and troubleshooting performance.

Develop, optimize, and deploy PySpark ETL / ELT pipelines using DataFrame API, UDFs, window functions, caching, partitioning, and broadcasting.

Deploy PySpark jobs using spark-submit in client / cluster mode with proper logging and error handling.

Install, configure, and manage Apache Airflow including UI, scheduler, webserver, connections, and variables.

Create, schedule, and monitor Airflow DAGs for PySpark jobs using SparkSubmitOperator, BashOperator, or PythonOperator.

Configure and manage cron jobs for scheduling data processing tasks where needed.

Install, configure, and optimize Trino (PrestoSQL) coordinator and worker nodes;

configure catalogs suchas S3, MySQL, or PostgreSQL.

Maintain Linux / Ubuntu servers including services, logs, environment variables, memory usage, and port conflict resolution.

Design and implement scalable data architectures using Azure Data Services including ADF, Synapse, ADLS, Azure SQL, and Databricks.

Develop, manage, and automate ETL / ELT pipelines using Azure Data Factory (Pipelines, Mapping Dataflows, Dataflows).

Monitor, troubleshoot, and optimize data pipelines across Spark, Airflow, Trino, and Azure platforms.

Work with structured, semi-structured, and unstructured data across multiple data sources and formats.

Implement data analytics, transformation, backup, and recovery solutions.

Perform data migration, upgrade, and modernization using Azure and database tools.

Implement CI / CD pipelines for data solutions using Azure DevOps and Git.

Ensure data quality, governance, lineage, metadata management, and security compliance across cloud and big data environments.

Design and optimize data models using star and snowflake schemas;

build data warehouses, Delta Lake, and Lakehouse systems.

Develop and rebuild reports / dashboards using Power BI, Tableau, or similar tools.

Collaborate with internal teams, clients, and business users to gather requirements and deliver high-quality data solutions.

Provide documentation, runbooks, and operational guidance.

Technical Skills :

Apache Spark (Open Source) & PySpark - Must

Apache Spark installation & cluster configuration (Ubuntu / Linux)

Spark master / worker setup (standalone & cluster mode)

Spark UI & History Server configuration and debugging

PySpark development (ETL pipelines, UDFs, window functions, DataFrame API)

Performance tuning (partitioning, caching, shuffles)

spark-submit deployment with monitoring and logging

2. Apache Airflow & Job Orchestration - Must

Airflow installation & configuration (UI, scheduler, webserver)

Creating and scheduling DAGs (SparkSubmitOperator, BashOperator, PythonOperator)

Retry logic, triggers, alerting, and log management

Cron job scheduling & process automation

3. Trino (PrestoSQL) - Must

Trino coordinator & worker node setup

Catalog configuration (S3, RDBMS sources)

Distributed SQL troubleshooting & performance optimization

4. Azure Data Services (nice to have)

Azure Data Factory

Azure Synapse Analytics

Azure SQL / Cosmos DB

Azure Data Lake Storage (Gen2)

Azure Databricks (Delta, Notebooks, Jobs)

Azure Event Hubs / Stream Analytics

5. Microsoft Fabric ( nice to have)

Lakehouse

Warehouse

Dataflows

Notebooks

Pipelines

6. Programming & Querying

Python

PySpark

SQL

Scala

7. Data Modeling & Warehousing

Star schema modeling

Snowflake schema modeling

Fact / dimension modeling

Data warehouse & Lakehouse design

Delta Lake / Lakehouse architectures

8. DevOps & CI / CD

Git / GitHub / Azure Repos

Azure DevOps pipelines (CI / CD)

Automated deployment for Spark, Airflow, ADF, Databricks, Fabric

9. BI Tools (Nice to have)

Power BI

Tableau

Report building, datasets, DAX

10. Linux / Ubuntu Server Knowledge

Shell scripting

Service management

Logs & environment variables

Soft Skills :

Excellent problem solving and communication skills

Able to work well in a team setting

Excellent organizational and time management skills

Taking end-to-end ownership

Production support & timely delivery

Self-driven, flexible and innovative

Microsoft Certified : Azure Data Engineer Associate (DP-203 / DP -300)

Knowledge of DevOps and CI / CD pipelines in Azure

Education :

BSc / BA in Computer Science, Engineering or a related field

Work Location : Vadodara, Gujarat, India

Create a job alert for this search

Principal Data Engineer • Vadodara, Gujarat, India

Related jobs

Data Engineer

Randstad Enterprise • Vadodara, IN

Shift Timing : 2 : 00 Pm - 11 : 00 Pm.Experience : 2- 4 years relevant Experience only ( this is a Junior position with us ). GCP - 2 years minimum working Experience.Worked with global stakeholders.Ran...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

ShimentoX Technologies • Vadodara, IN

Data Engineer (Strong with Building data connectors).Key Skills : Python, Data Connectors, Metadata, API Integration-Rest / GraphQL. Must have proven background in building data connectors.Experience s...Show more

Last updated: 2 days ago • Promoted

Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

SkillsCapital • Vadodara, Gujarat, India

Remote

We are hiring multiple Data Engineers to join international data platform, analytics, and cloud engineering teams.These fully remote, long-term freelance roles are ideal for engineers who can build...Show more

Last updated: 5 days ago • Promoted

Azure Data Engineer

Paritas Recruitment • Vadodara, IN

Azure Data Engineer - 6 Month+ Rolling Contract.Remote - (Sunday to Thursday working days).Paritas is working with a global IT Consultancy & leading Energy client who are seeking an experienced and...Show more

Last updated: 2 days ago • Promoted

Senior Data Engineer

Ironbook AI • Vadodara, IN

The ideal candidate will have strong experience with cloud platforms, modern ETL / ELT tools, and deep technical skills in Python, SQL, and distributed data frameworks. Design, develop, and maintain s...Show more

Last updated: 18 hours ago • Promoted • New!

Senior Data Engineer

Donyati • Vadodara, IN

We are seeking a highly skilled Senior Data Engineer to join our team in building a modern data platform on AWS.You will play a key role in transitioning from legacy systems to a scalable, cloud-na...Show more

Last updated: 16 days ago • Promoted

Lead Data Engineer

Ironbook AI • Vadodara, IN

We are seeking an experienced and driven Lead Data Engineer to spearhead the.AI use cases across the organization.Minimum 7 years of experience in data engineering, with at.Strong hands-on experien...Show more

Last updated: 18 hours ago • Promoted • New!

Data Engineer - Palantir Foundry

NP Group • Vadodara, Gujarat, India

Data Engineer - Palantir Foundry, Workshop, Pyspark & Typescript Fully Remote - Long Term (initially 6 months) full time contract c$12. We have an immediate requirement for an experienced Data Eng...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

TerraGiG • Vadodara, IN

Lead the design, development, and implementation of data solutions using AWS and Snowflake.Collaborate with cross-functional teams to understand business requirements and translate them into techni...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

Tata Consultancy Services • Vadodara, IN

TCS has been a great pioneer in feeding the fire of Techies like you.We are a global leader in the technology arena and there’s nothing that can stop us from growing together.Your role is of key im...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

IntraEdge • Vadodara, IN

We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show more

Last updated: 30+ days ago • Promoted

GCP Data Engineer

Adastra • Vadodara, IN

We are looking for a proactive and solution-oriented GCP Data Engineer to join our team.This role requires hands-on experience in Google Cloud Platform (GCP), especially with BigQuery and Airflow, ...Show more

Last updated: 17 days ago • Promoted

Data Engineer

EXL • Vadodara, IN

The person will be part of the EDP Data Platform (EDP) team for a major Insurance client.He / She will work with different stakeholders for architecting & Building EDP application platform to suppor...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

Staffingine LLC • Vadodara, IN

The Data Engineer will be responsible for designing, developing, and optimizing scalable data pipelines and cloud-based data solutions. This role requires strong Python programming skills, expertise...Show more

Last updated: 7 days ago • Promoted

Data Engineer

BayOne Solutions • Vadodara, IN

We are seeking a highly experienced Data Engineer to join our MarTech team and play a pivotal role in driving innovation within our microservices architecture, with a strong emphasis on data engine...Show more

Last updated: 30+ days ago • Promoted

Principal Data Engineer

CodeMyMobile • Vadodara, Gujarat, India

Experience Required - 7 to 10 Years How to Apply : Are you a Data Engineer who cares about clean engineering, autonomy, and solving real data challenges? If this sounds like you, we’d love to conn...Show more

Last updated: 30+ days ago • Promoted

Data Engineer

MyRemoteTeam Inc • Vadodara, IN

MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent.We empower businesses by providing world-class software engineers, operations suppo...Show more

Last updated: 18 hours ago • Promoted • New!

Lead Data Engineer

Confidential • Vadodara, Gujarat, India

Skillset Required • 7+ years of experience in software development, with a strong foundation in distributed systems, cloud-native architectures, and data platforms. Expertise in big data technologie...Show more

Last updated: 12 hours ago • Promoted • New!