Talent.com
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)AIMLEAP • Panchkula, Haryana, India
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

AIMLEAP • Panchkula, Haryana, India
1 day ago
Job description

Data Engineering Manager – Web Crawling & Pipeline Architecture

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.

Expertise in designing, building, and optimizing scalable data pipelines , preferably using workflow orchestration tools such as Airflow or Celery .

Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.

Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.

Deep understanding of web crawling frameworks , proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).

Strong expertise in AI-driven automation , including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.

Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.

Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.

Establish pipeline orchestration using Airflow, Celery , or similar distributed processing technologies.

Define and enforce data quality, validation, and security measures across all data flows and pipelines.

Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.

Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.

Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.

Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling

Qualifications

Bachelor's or master's degree in engineering, Computer Science, or related field.

7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems .

Strong expertise in Python, SQL , and modern data processing practices.

Experience working with Airflow, Celery , or similar workflow automation tools.

Solid understanding of proxy systems, anti-bot techniques , and scalable crawler architecture.

Hands-on experience with cloud data platforms (AWS / GCP / Azure).

Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).

Strong analytical, architectural, and leadership skills.

Create a job alert for this search

Engineering Manager • Panchkula, Haryana, India

Related jobs
Shopify Project Manager

Shopify Project Manager

Upbott Consulting, Inc • panchkula, haryana, in
E-commerce Project Manager- Shopify / Big commerce.BigCommerce or Shopify projects.Candidates must have led end-to-end e-commerce implementations specifically on. This role requires someone who under...Show more
Last updated: 2 days ago • Promoted
Data Scientist

Data Scientist

Recro • panchkula, haryana, in
We’re seeking a highly skilled, hands-on Data Scientist with 4–10 years of experience in applied AI / ML to join our fast-paced team. This role requires deep expertise in transformer architectures and...Show more
Last updated: 30+ days ago • Promoted
SAP BW / 4HANA Engineer

SAP BW / 4HANA Engineer

Yoda Tech • panchkula, haryana, in
We are seeking a highly skilled.ABAP expertise to design, build, and optimize enterprise-grade data warehousing and analytics solutions. This role is hands-on and requires deep technical proficiency...Show more
Last updated: 13 days ago • Promoted
Data Engineer

Data Engineer

TerraGiG • panchkula, haryana, in
Lead the design, development, and implementation of data solutions using AWS and Snowflake.Collaborate with cross-functional teams to understand business requirements and translate them into techni...Show more
Last updated: 30+ days ago • Promoted
Senior Data Engineer

Senior Data Engineer

Primesoft Inc • panchkula, haryana, in
Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
Last updated: 30+ days ago • Promoted
Senior Data Engineer

Senior Data Engineer

CXC • panchkula, haryana, in
Please apply only if you are available to work in Australian time zone and comfortable with 6 months contract duration • •. We’re seeking a highly skilled and autonomous.Power BI implementations to jo...Show more
Last updated: 5 hours ago • Promoted • New!
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 To 2 Yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 To 2 Yrs)

AIMLEAP • Panchkula, Republic Of India, IN
Data Engineering Manager – Web Crawling & Pipeline Architecture.Tech / MCA / Computer Science / IT.IT / Data / AI / E-commerce / FinTech / Healthcare. Experience working with cloud platforms such as...Show more
Last updated: 1 day ago • Promoted
Project Manager

Project Manager

Hello Energy • panchkula, haryana, in
Client & Utility data connections onboarding.We are looking for a Project Manager who can coordinate and deliver multiple small-to-medium projects simultaneously, each representing a client’s build...Show more
Last updated: 2 days ago • Promoted
AI Analyst

AI Analyst

Aventis Solutions • panchkula, haryana, in
Aventis Solutions is igniting the AI revolution : .They have just launched The AI Executive podcast, which can be found here : . Now, our tech partner is establishing a new AI Innovation Hub in Pune, In...Show more
Last updated: 30+ days ago • Promoted
Data Architect

Data Architect

Tech Mahindra • panchkula, haryana, in
We are seeking a highly skilled professional who can.ETL processes, and data quality initiatives.Having experience into any Cloud (Azure / GCP / AWS). Proposing solutions to optimize existing.Develo...Show more
Last updated: 26 days ago • Promoted
SEO Specialist

SEO Specialist

Impressiko • Dera Bassi, Punjab, India
We’re Hiring : SEO Specialist (2+ Years Experience).Chandigarh | Zirakpur | Mohali | Panchkula | Kharar (Hybrid).Impressiko – AI-Powered Digital Marketing & Website Design Agency.Are you passionate ...Show more
Last updated: 22 days ago • Promoted
Senior Technical Project Manager

Senior Technical Project Manager

VBeyond Corporation • panchkula, haryana, in
We are seeking a seasoned Senior Technical Project Manager to lead the planning, execution, and delivery of complex technology initiatives. The ideal candidate is a strategic thinker with a strong t...Show more
Last updated: 5 hours ago • Promoted • New!
Full Stack Engineer

Full Stack Engineer

Plumloom • panchkula, haryana, in
We’re building an AI evaluation platform that’s becoming indispensable for AI teams.As a Full Stack Developer (backend focused), you’ll craft intuitive, high-performance user experiences and mainta...Show more
Last updated: 30+ days ago • Promoted
Tech Lead Full Stack-Contract

Tech Lead Full Stack-Contract

Gravity Infosolutions, Inc. • panchkula, haryana, in
Job description for Tech Lead Full Stack : .Conducts code review, contributes writing code, proficient in Java + at least one language between Typescript and Python. Basic understanding of Infrastruct...Show more
Last updated: 5 hours ago • Promoted • New!
Lead Full-Stack + AI Engineer (Founding Team)

Lead Full-Stack + AI Engineer (Founding Team)

Grovio AI • panchkula, haryana, in
We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
Last updated: 2 days ago • Promoted
Data Product Owner

Data Product Owner

Integrated Wireless Solutions • panchkula, haryana, in
We're looking for an experienced.Candidate & Recruiting Data Product Portfolio.In this role, you’ll define the vision, roadmap, and delivery of data products that power hiring insights, analytics, ...Show more
Last updated: 5 hours ago • Promoted • New!
Business Develop Manager

Business Develop Manager

Grantify • panchkula, haryana, in
Grantify is an innovative education platform that bridges students and universities through a transparent admissions and tuition-matching system. By aligning student budgets and academic goals with ...Show more
Last updated: 1 day ago • Promoted
E-commerce Technical Project Manager( Bigcommerce / Shopify)

E-commerce Technical Project Manager( Bigcommerce / Shopify)

Upbott Consulting, Inc • panchkula, haryana, in
E-commerce Technical Project Manager.BigCommerce or Shopify projects.Candidates must have led end-to-end e-commerce implementations specifically on. This role requires someone who understands the Bi...Show more
Last updated: 2 days ago • Promoted