Talent.com
This job offer is not available in your country.
Lead Data Engineer

Lead Data Engineer

FusemachinesPune, IN
30+ days ago
Job type
  • Quick Apply
Job description

About Fusemachines Fusemachines is a leading AI strategy, talent, and education services provider.

Founded by Sameer Maskey Ph.D., Adjunct Associate Professor at Columbia University, Fusemachines has a core mission of democratizing AI.

With a presence in 4 countries (Nepal, United States, Canada, and Dominican Republic and more than 450 employees).

Fusemachines seeks to bring its global expertise in AI to transform companies around the world.

Location : Remote (Full-time) About the role This is a remote full-time position, responsible for designing, building, testing, optimizing and maintaining the infrastructure and code required for data integration, storage, processing, pipelines and analytics (BI, visualization and Advanced Analytics) from ingestion to consumption, implementing data flow controls, and ensuring high data quality and accessibility for analytics and business intelligence purposes.

This role requires a strong foundation in programming, and a keen understanding of how to integrate and manage data effectively across various storage systems and technologies.

We're looking for someone who can quickly ramp up, contribute right away and lead the work in Data & Analytics, helping from backlog definition, to architecture decisions, and lead technical the rest of the team with minimal oversight.

We are looking for a skilled Sr.

Data Engineer / Technical Lead with a strong background in Python , SQL , Pyspark , Redshift and AWS cloud-based large scale data solutions with a passion for data quality, performance and cost optimization.

The ideal candidate will develop in an Agile environment, and would have GCP experience too, to contribute to the migration from AWS to GCP.

This role is perfect for an individual passionate about leading, leveraging data to drive insights, improve decision-making, and support the strategic goals of the organization through innovative data engineering solutions.  Qualification / Skill Set Requirement : Must have a full-time Bachelor's degree in Computer Science Information Systems, Engineering, or a related field.

5+ years of real-world data engineering development experience in AWS and GCP (certifications preferred).

Strong expertise in Python, SQL, PySpark and AWS in an Agile environment, with a proven track record of building and optimizing data pipelines, architectures, and datasets, and proven experience in data storage, modeling, management, lake, warehousing, processing / transformation, integration, cleansing, validation and analytics.

Senior person who can understand requirements and design end to end solutions with minimal oversight.

Strong programming Skills in one or more languages such as Python , Scala, and proficient in writing efficient and optimized code for data integration, storage, processing and manipulation.

Strong knowledge SDLC tools and technologies,  including project management software (Jira or similar), source code management (GitHub or similar), CI / CD system (GitHub actions, AWS CodeBuild or similar) and binary repository manager (AWS CodeArtifact or similar).

Good understanding of Data Modeling and Database Design Principles.

Being able to design and implement efficient database schemas that meet the requirements of the data architecture to support data solutions.

Strong SQL skills and experience working with complex data sets, Enterprise Data Warehouse and writing advanced SQL queries.

Proficient with Relational Databases (RDS, MySQL, Postgres, or similar) and NonSQL Databases (Cassandra, MongoDB, Neo4j, etc.).

Skilled in Data Integration from different sources such as APIs, databases, flat files, event streaming.  Strong experience in implementing data pipelines and efficient ELT / ETL processes, batch and real-time, in AWS and using open source solutions, being able to develop custom integration solutions as needed, including Data Integration from different sources such as APIs (PoS integrations is a plus), ERP (Oracle and Allegra are a plus), databases, flat files, Apache Parquet, event streaming, including cleansing, transformation and validation of the data.

Strong experience with scalable and distributed Data Technologies such as Spark / PySpark , DBT and Kafka , to be able to handle large volumes of data.

Experience with stream-processing systems : Storm, Spark-Streaming, etc.

is a plus.

Strong experience in designing and implementing Data Warehousing solutions in AWS with Redshift .

Demonstrated experience in designing and implementing efficient ELT / ETL processes that extract data from source systems, transform it (DBT), and load it into the data warehouse.

Strong experience in Orchestration using Apache Airflow.

Expert in Cloud Computing in AWS, including deep knowledge of a variety of AWS services like Lambda, Kinesis, S3 , Lake Formation, EC2, EMR , ECS / ECR, IAM, CloudWatch, etc Good understanding of Data Quality and Governance, including implementation of data quality checks and monitoring processes to ensure that data is accurate, complete, and consistent.

Good understanding of BI solutions including Looker and LookML (Looker Modeling Language).

Strong knowledge and hands-on experience of DevOps principles, tools and technologies (GitHub and AWS DevOps) including continuous integration, continuous delivery (CI / CD), infrastructure as code (IaC – Terraform), configuration management, automated testing, performance tuning and cost management and optimization.

Good Problem-Solving skills : being able to troubleshoot data processing pipelines and identify performance bottlenecks and other issues.

Possesses strong leadership skills with a willingness to lead, create Ideas, and be assertive.

Strong project management and organizational skills.

Excellent communication skills to collaborate with cross-functional teams, including business users, data architects, DevOps / DataOps / MLOps engineers, data analyst, data scientists, developers, and operations teams.

Essential to convey complex technical concepts and insights to non-technical stakeholders effectively.

Ability to document processes, procedures, and deployment configurations.

Responsibilities :   Design, implement, deploy, test and maintain highly scalable and efficient data architectures, defining and maintaining standards and best practices for data management independently with minimal guidance.

Ensuring the scalability, reliability, quality and performance of data systems.

Mentoring and guiding junior / mid-level data engineers.

Collaborating with Product, Engineering, Data Scientists and Analysts to understand data requirements and develop data solutions, including reusable components.

Evaluating and implementing new technologies and tools to improve data integration, data processing and analysis.

Design architecture, observability and testing strategies, and building reliable infrastructure and data pipelines.

Takes ownership of storage layer, data management tasks, including schema design, indexing, and performance tuning.

Swiftly address and resolve complex data engineering issues, incidents and resolve bottlenecks in SQL queries and database operations.

Conduct Discovery on existing Data Infrastructure and Proposed Architecture.

Evaluate and implement cutting-edge technologies and methodologies and continue learning and expanding skills in data engineering and cloud platforms,  to improve and modernize existing data systems.

Evaluate, design, and implement data governance solutions : cataloging, lineage, quality and data governance frameworks that are suitable for a modern analytics solution, considering industry-standard best practices and patterns.  Define and document data engineering architectures, processes and data flows.

Assess best practices and design schemas that match business needs for delivering a modern analytics solution (descriptive, diagnostic, predictive, prescriptive).

Be an active member of our Agile team, participating in all ceremonies and continuous improvement activities.

Equal Opportunity Employer : Race, Color, Religion, Sex, Sexual Orientation, Gender Identity, National Origin, Age, Genetic Information, Disability, Protected Veteran Status, or any other legally protected group status.

Powered by JazzHR

Create a job alert for this search

Lead Data Engineer • Pune, IN

Related jobs
  • Promoted
Manager, Data Engineer

Manager, Data Engineer

MastercardPune, Maharashtra, India
Manager, Data Engineer - Apache Nifi, Python, PySpark, Hadoop, Cloudera platforms, and Airflow.Mastercard is a global technology company in the payments industry. Our mission is to connect and power...Show moreLast updated: 23 hours ago
  • Promoted
Data Engineer

Data Engineer

Kumaran Systemspune, maharashtra, in
Strong hands-on experience in Databricks.Proven expertise in building and managing data ingestion pipelines.Exposure to Databricks data ingestion jobs along with incident management frameworks.Expe...Show moreLast updated: 23 days ago
  • Promoted
Lead Data Engineer (databricks)

Lead Data Engineer (databricks)

Mayur Chhatbar (Proprietor Of KD Servicess)Ahmedabad, Pune
We are looking for an accomplished Lead Data Engineer with expertise in Databricks to join our dynamic team.This role is crucial for enhancing our data engineering capabilities, and it offers the c...Show moreLast updated: 25 days ago
  • Promoted
AWS Data Engineer Lead

AWS Data Engineer Lead

CoforgePune, Maharashtra, India
We are Hiring : AWS Data Engineer Lead at Coforge Ltd.Interested candidates can share their updated CV at : .The ideal candidate will have a proven track record of designing, building, and maintaining...Show moreLast updated: 11 days ago
  • Promoted
Data Engineer

Data Engineer

V2Softpune, maharashtra, in
Experience to deal with large stream volumes.Implementation experience ( atleast couple of projects ) in MSK / Flink / Spark / Scala. Very good knowledge on atleast 3 out of the 4 technologies.Desirable e...Show moreLast updated: 1 day ago
  • Promoted
Data Engineer Lead

Data Engineer Lead

CoffeeBeansPune, Maharashtra, India
Hybrid (3 days WFO, 2 days WFH).Tech-lead in one of the feature teams, candidate need to be work along with team lead in handling the team without much guidance. Good communication and leadership sk...Show moreLast updated: 8 days ago
  • Promoted
Lead Data Engineer

Lead Data Engineer

Eucloid Data SolutionsPune, IN
Eucloid is looking for a Lead Data Engineer to join our Data Platform team supporting various business applications.The ideal candidate will support development of data infrastructure on Databricks...Show moreLast updated: 24 days ago
  • Promoted
Lead Data Engineer

Lead Data Engineer

ITQube LTDPune, Maharashtra, India
Looking for a Data Engineer to join our engineering will contribute directly to the design, automation, and optimization of our data processes, primarily developing solutions in Python within the A...Show moreLast updated: 11 days ago
  • Promoted
Lead Data Engineer

Lead Data Engineer

ATOM Systems Private LimitedPune, IN
We are seeking a highly skilled and driven Data Engineering Lead to lead our data engineering team.The ideal candidate combines strong leadership and technical expertise with the ability to deliver...Show moreLast updated: 1 day ago
  • Promoted
Data Engineer

Data Engineer

Manuh TechnologiesPune, IN
S3, Glue, Redshift, EMR, Lambda).Develop automation scripts and tools using.Collaborate with data analysts, data scientists, and business stakeholders to ensure data availability and reliability.Tr...Show moreLast updated: 30+ days ago
  • Promoted
Data Engineer

Data Engineer

SapaadPune, IN
Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering.F&B businesses across 40+ countries. Driven by a passionate team of developers, designers, a...Show moreLast updated: 30+ days ago
  • Promoted
Lead Data Engineer

Lead Data Engineer

Searce IncPune, IN
Searce means ‘a fine sieve’ & indicates ‘to refine, to analyze, to improve’.It signifies our way of working : To improve to the finest degree of excellence, ‘solving for better’ every time.Searcians...Show moreLast updated: 11 days ago
  • Promoted
Lead Data Engineer

Lead Data Engineer

ACL DigitalPune, Maharashtra, India
Experience : - 10 years to 13 years.Location : Pune,Bangalore,Chennai.Notice Period : - Immediate Joiner only.Experience in Python & pyspark. Experience in ETL Development, ETL pipeline.Experience in Dat...Show moreLast updated: 23 days ago
  • Promoted
Senior Data Engineer

Senior Data Engineer

InfogainPune, IN
Big Data Engineer (Lead) : As a Big Data Engineer (Lead), you will be responsible for leading a team of big data engineers. You will work closely with clients and team members to understand their req...Show moreLast updated: 30+ days ago
  • Promoted
Data Engineer

Data Engineer

EverisePune, IN
Join us on our mission to elevate customer experiences for people around the world.As a member of the Everise family, you will be part of a global experience company that believes in being people-f...Show moreLast updated: 23 days ago
  • Promoted
GenAI Lead Engineer (Investment Data Platforms)

GenAI Lead Engineer (Investment Data Platforms)

Vichara Technologiespune, maharashtra, in
We are seeking a highly skilled.This role will also involve leading a cross-functional team of.Machine Learning Engineers and UI Developers. Develop custom frameworks using.Optimize LLM usage for in...Show moreLast updated: 30+ days ago
  • Promoted
Data Engineer

Data Engineer

INFEC Servicespune, maharashtra, in
Design, develop, and optimize data pipelines and ETL processes on GCP or Azure.Work with structured and unstructured data, integrating sources such as databases, APIs, and streaming platforms.Imple...Show moreLast updated: 20 days ago
  • Promoted
Lead Data Engineer - Synapse Analytics

Lead Data Engineer - Synapse Analytics

iTechMinds Consulting LLPPune
Job Description : We are seeking a highly skilled Azure Data Engineer to join our dynamic team.As a Data Engineer at LUMIQ, you will play a crucial role in designing...Show moreLast updated: 30+ days ago
  • Promoted
Data Engineer

Data Engineer

TalogyPune, IN
This opportunity is ideal for a determined and proactive individual who has a wide range of skills in a variety of database administration, reporting and dashboarding disciplines.This role requires...Show moreLast updated: 30+ days ago
  • Promoted
  • New!
Lead Data engineer

Lead Data engineer

ConfidentialChennai, Hyderabad / Secunderabad, Telangana, Pune
A minimum of 6+ years of experience in data modeling, data warehousing, and building ETL pipelines.Strong experience in ETL platforms like Informatica ETL, Powercenter, IICS, or similar tools.Deep ...Show moreLast updated: 19 hours ago