Talent.com
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)AIMLEAP • Borivali, Maharashtra, India
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

AIMLEAP • Borivali, Maharashtra, India
13 hours ago
Job description

Data Engineering Manager – Web Crawling & Pipeline Architecture

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.

Expertise in designing, building, and optimizing scalable data pipelines , preferably using workflow orchestration tools such as Airflow or Celery .

Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.

Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.

Deep understanding of web crawling frameworks , proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).

Strong expertise in AI-driven automation , including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.

Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.

Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.

Establish pipeline orchestration using Airflow, Celery , or similar distributed processing technologies.

Define and enforce data quality, validation, and security measures across all data flows and pipelines.

Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.

Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.

Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.

Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling

Qualifications

Bachelor's or master's degree in engineering, Computer Science, or related field.

7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems .

Strong expertise in Python, SQL , and modern data processing practices.

Experience working with Airflow, Celery , or similar workflow automation tools.

Solid understanding of proxy systems, anti-bot techniques , and scalable crawler architecture.

Hands-on experience with cloud data platforms (AWS / GCP / Azure).

Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).

Strong analytical, architectural, and leadership skills.

Create a job alert for this search

Engineering Manager • Borivali, Maharashtra, India

Related jobs
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

AIMLEAP • Thane, IN
Data Engineering Manager – Web Crawling & Pipeline Architecture.Tech / MCA / Computer Science / IT .IT / Data / AI / E-commerce / FinTech / Healthcare . Experience working with cloud platforms such ...Show more
Last updated: 21 hours ago • Promoted • New!
Engineering Manager

Engineering Manager

Confidential • Kalyan-Dombivli, IN
The ideal candidate will be responsible for managing and inspiring his or her team to achieve their performance metrics.Your role will involve strategizing, project management, part staff managemen...Show more
Last updated: 3 days ago • Promoted
Lead Engineer

Lead Engineer

Hyqoo • Thane, IN
Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show more
Last updated: 25 days ago • Promoted
MLOps Engineer — AWS SageMaker

MLOps Engineer — AWS SageMaker

Hadron Talent -Hadronfinsys • Kalyan-Dombivli, IN
MLOps Engineer — AWS SageMaker.A large global enterprise (name not disclosed).Full-time / Long-term contract.You will work within a global data & analytics team to design, deploy, and maintain robu...Show more
Last updated: 1 day ago • Promoted
E-commerce Technical Project Manager( Bigcommerce / Shopify)

E-commerce Technical Project Manager( Bigcommerce / Shopify)

Upbott Consulting, Inc • Kalyan-Dombivli, IN
E-commerce Technical Project Manager.BigCommerce or Shopify projects.Candidates must have led end-to-end e-commerce implementations specifically on. This role requires someone who understands the Bi...Show more
Last updated: 6 days ago • Promoted
Data Engineer

Data Engineer

TerraGiG • Thane, IN
Lead the design, development, and implementation of data solutions using AWS and Snowflake.Collaborate with cross-functional teams to understand business requirements and translate them into techni...Show more
Last updated: 30+ days ago • Promoted
Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

SkillsCapital • Kalyan-Dombivli, IN
Remote
These fully remote, long-term freelance roles are ideal for engineers who can build scalable data pipelines, work with modern cloud-native data stacks, and support large-scale enterprise data initi...Show more
Last updated: 5 days ago • Promoted
Senior Data Engineer - Data Acquisition

Senior Data Engineer - Data Acquisition

InfoBeans • Kalyan-Dombivli, IN
We are seeking a highly skilled.Senior Data Engineer – Data Acquisition (ODS).The ideal candidate will have extensive hands-on experience in building and optimizing data ingestion and transformatio...Show more
Last updated: 23 days ago • Promoted
Lead Data Engineer

Lead Data Engineer

Guidanz Inc • Kalyan-Dombivli, IN
BI Connector is the industry leading solution for integrating Oracle Fusion Cloud data into modern BI platforms like Power BI, Tableau, and Data Warehouse, without complex ETL.Our Data Architecture...Show more
Last updated: 1 day ago • Promoted
Senior Solutions Architect (Data)

Senior Solutions Architect (Data)

Hillview Consulting Solutions • Kalyan-Dombivli, IN
If candidate is in Mumbai this would be onsite in Andheri East, Mumbai, Maharashtra.We’re looking for a senior, hands-on. You’ll own architecture for ETL / ELT, data warehousing, analytics pipelines, ...Show more
Last updated: 16 days ago • Promoted
Data Engineer

Data Engineer

IntraEdge • Thane, IN
We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show more
Last updated: 30+ days ago • Promoted
Lead Data Engineer

Lead Data Engineer

Confidential • Thane, IN
Expertise in big data technologies such as Apache Spark and real-time streaming technologies like Apache Kafka.Strong programming skills in Python, Java, C++, SQL etc. Advanced knowledge of a major ...Show more
Last updated: 11 hours ago • Promoted • New!
Project Manager – Data Engineering & Analytics

Project Manager – Data Engineering & Analytics

Brillio • Kalyan-Dombivli, IN
We are looking for a skilled Technical Project Manager to lead and deliver projects in data engineering and analytics.You will manage cross-functional teams to execute data platform, pipeline, and ...Show more
Last updated: 30+ days ago • Promoted
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 To 7yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture (2 To 7yrs)

AIMLEAP • Dombivli, Republic Of India, IN
Data Engineering Manager – Web Crawling & Pipeline Architecture.Tech / MCA / Computer Science / IT.IT / Data / AI / E-commerce / FinTech / Healthcare. Experience working with cloud platforms such as...Show more
Last updated: 13 hours ago • Promoted • New!
Lead Data Engineer

Lead Data Engineer

Ironbook AI • Kalyan-Dombivli, IN
We are seeking an experienced and driven Lead Data Engineer to spearhead the.AI use cases across the organization.Minimum 7 years of experience in data engineering, with at.Strong hands-on experien...Show more
Last updated: 11 hours ago • Promoted • New!
AWS Data Architect

AWS Data Architect

ACL Digital • Kalyan-Dombivli, IN
AWS (S3, Redshift, Glue, Lake Formation, IAM).Proficient in data modeling, performance tuning, and security best practices. .AWS Certified Solutions Architect preferred.Show more
Last updated: 21 days ago • Promoted
Engineering Manager

Engineering Manager

Cargoz.com • Kalyan-Dombivli, IN
This role is perfect for leaders who excel in dynamic, high-velocity environments, enjoy developing both people and systems, and want to help shape our product and engineering culture from the grou...Show more
Last updated: 21 hours ago • Promoted • New!
Senior Data Engineer

Senior Data Engineer

Ironbook AI • Kalyan-Dombivli, IN
The ideal candidate will have strong experience with cloud platforms, modern ETL / ELT tools, and deep technical skills in Python, SQL, and distributed data frameworks. Design, develop, and maintain s...Show more
Last updated: 11 hours ago • Promoted • New!