Talent.com
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)AIMLEAP • Baddi, Himachal Pradesh, India
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

AIMLEAP • Baddi, Himachal Pradesh, India
11 hours ago
Job description

Data Engineering Manager – Web Crawling & Pipeline Architecture

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.

Expertise in designing, building, and optimizing scalable data pipelines , preferably using workflow orchestration tools such as Airflow or Celery .

Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.

Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.

Deep understanding of web crawling frameworks , proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).

Strong expertise in AI-driven automation , including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.

Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.

Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.

Establish pipeline orchestration using Airflow, Celery , or similar distributed processing technologies.

Define and enforce data quality, validation, and security measures across all data flows and pipelines.

Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.

Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.

Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.

Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling

Qualifications

Bachelor's or master's degree in engineering, Computer Science, or related field.

7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems .

Strong expertise in Python, SQL , and modern data processing practices.

Experience working with Airflow, Celery , or similar workflow automation tools.

Solid understanding of proxy systems, anti-bot techniques , and scalable crawler architecture.

Hands-on experience with cloud data platforms (AWS / GCP / Azure).

Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).

Strong analytical, architectural, and leadership skills.

Create a job alert for this search

Engineering Manager • Baddi, Himachal Pradesh, India

Related jobs
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 To 2 Yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 To 2 Yrs)

AIMLEAP • Baddi, Republic Of India, IN
Data Engineering Manager – Web Crawling & Pipeline Architecture.Tech / MCA / Computer Science / IT.IT / Data / AI / E-commerce / FinTech / Healthcare. Experience working with cloud platforms such as...Show more
Last updated: 11 hours ago • Promoted • New!
Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

SkillsCapital • baddi, himachal pradesh, in
Remote
These fully remote, long-term freelance roles are ideal for engineers who can build scalable data pipelines, work with modern cloud-native data stacks, and support large-scale enterprise data initi...Show more
Last updated: 3 hours ago • Promoted • New!
Full Stack Engineer

Full Stack Engineer

Programmers.io • baddi, himachal pradesh, in
Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
Last updated: 15 days ago • Promoted
Senior Engineering Manager

Senior Engineering Manager

Immacule Lifesciences • Nālāgarh, Republic Of India, IN
Equipment Design reviews, Plant layout Reviews and verified, PFD & P&ID finalization, piping Design, piping Isometrics, Design philosophy Review, Project cost & Time Estimation.Planning and Schedul...Show more
Last updated: 25 days ago • Promoted
Project Engineering Director

Project Engineering Director

Immacule Lifesciences • Nālāgarh, Republic Of India, IN
Equipment Design reviews, Plant layout Reviews and verified, PFD & P&ID finalization, piping Design, piping Isometrics, Design philosophy Review, Project cost & Time Estimation.Planning and Schedul...Show more
Last updated: 25 days ago • Promoted
Manager Engineering & Projects

Manager Engineering & Projects

Immacule Lifesciences • Nalagarh, Himachal Pradesh, India
Equipment Design reviews, Plant layout Reviews and verified, PFD & P&ID finalization, piping Design, piping Isometrics, Design philosophy Review, Project cost & Time Estimation.Planning and Schedul...Show more
Last updated: 25 days ago • Promoted
Senior AI Engineer

Senior AI Engineer

Xtnsion.AI • baddi, himachal pradesh, in
AI is building the agentic CX layer for modern businesses — AI voice + chat agents that autonomously handle bookings, lead follow-up, support workflows, CRM actions, and more across phone, WhatsApp...Show more
Last updated: 15 hours ago • Promoted • New!
Business Develop Manager

Business Develop Manager

Grantify • baddi, himachal pradesh, in
Grantify is an innovative education platform that bridges students and universities through a transparent admissions and tuition-matching system. By aligning student budgets and academic goals with ...Show more
Last updated: 15 hours ago • Promoted • New!
Principal QA Engineer (Cypress)

Principal QA Engineer (Cypress)

CES • Baddi, Himachal Pradesh, India
We are seeking a Principal QA Engineer to join our agile development team and take ownership of delivering high-quality software through advanced testing strategies. This is a hands-on IC role w...Show more
Last updated: 30+ days ago • Promoted
Full-Stack Developer - 20414

Full-Stack Developer - 20414

Turing • Baddi, Himachal Pradesh, India
Role Overview : Turing is looking for experienced Full Stack Developers to build modern solutions that power AI products and evaluation workflows. React / Angular / Vue) to implement features, improve c...Show more
Last updated: 30+ days ago • Promoted
AI / ML Engineer

AI / ML Engineer

Cozzera • Baddi, Himachal Pradesh, India
Job Title : AI / ML Engineer Experience : 5+ Years Location : Remote We are looking for an experienced AI / ML Engineer with a strong background in machine learning and deep learning, especially in time...Show more
Last updated: 11 hours ago • Promoted • New!
Engineering Project Lead

Engineering Project Lead

Immacule Lifesciences • Nālāgarh, Republic Of India, IN
Equipment Design reviews, Plant layout Reviews and verified, PFD & P&ID finalization, piping Design, piping Isometrics, Design philosophy Review, Project cost & Time Estimation.Planning and Schedul...Show more
Last updated: 25 days ago • Promoted
Sr. Azure Data Architect & Presales Solution

Sr. Azure Data Architect & Presales Solution

Programmers.io • baddi, himachal pradesh, in
We offer a vibrant and collaborative work environment, cutting-edge tools and technologies, and ample opportunities for professional growth. Job Title : Azure Data Architect.Experience required : 15+ ...Show more
Last updated: 18 days ago • Promoted
Senior Data Engineer

Senior Data Engineer

Primesoft Inc • baddi, himachal pradesh, in
Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
Last updated: 30+ days ago • Promoted
Data Scientist

Data Scientist

Recro • baddi, himachal pradesh, in
We’re seeking a highly skilled, hands-on Data Scientist with 4–10 years of experience in applied AI / ML to join our fast-paced team. This role requires deep expertise in transformer architectures and...Show more
Last updated: 30+ days ago • Promoted
AI Analyst

AI Analyst

Aventis Solutions • baddi, himachal pradesh, in
Aventis Solutions is igniting the AI revolution : .They have just launched The AI Executive podcast, which can be found here : . Now, our tech partner is establishing a new AI Innovation Hub in Pune, In...Show more
Last updated: 30+ days ago • Promoted
Cloud Engineer - Full Remote (Global Cloud-Native Projects)

Cloud Engineer - Full Remote (Global Cloud-Native Projects)

SkillsCapital • baddi, himachal pradesh, in
Remote
These long-term, fully remote freelance roles are ideal for engineers with strong hands-on experience in AWS, Azure, or Google Cloud who want to build scalable, secure, high-performance cloud solut...Show more
Last updated: 3 hours ago • Promoted • New!
Director of Technical Engineering (configuration) - LifeScience Experience

Director of Technical Engineering (configuration) - LifeScience Experience

Qinecsa Solutions • Baddi, India
Job Description : We are seeking a Director / Manager of Technical Engineer to oversee the technical design, development and deployment of client solutions (configurations, migrations and integration...Show more
Last updated: less than 1 hour ago • Promoted • New!