Talent.com
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)AIMLEAP • ludhiana, punjab, in
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

AIMLEAP • ludhiana, punjab, in
4 hours ago
Job description

Data Engineering Manager – Web Crawling & Pipeline Architecture

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

  • Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.
  • Expertise in designing, building, and optimizing scalable data pipelines , preferably using workflow orchestration tools such as Airflow or Celery .
  • Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.
  • Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.
  • Deep understanding of web crawling frameworks , proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).
  • Strong expertise in AI-driven automation , including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

  • Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.
  • Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.
  • Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.
  • Establish pipeline orchestration using Airflow, Celery , or similar distributed processing technologies.
  • Define and enforce data quality, validation, and security measures across all data flows and pipelines.
  • Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.
  • Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.
  • Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.
  • Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling
  • Qualifications

  • Bachelor's or master's degree in engineering, Computer Science, or related field.
  • 7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems .
  • Strong expertise in Python, SQL , and modern data processing practices.
  • Experience working with Airflow, Celery , or similar workflow automation tools.
  • Solid understanding of proxy systems, anti-bot techniques , and scalable crawler architecture.
  • Hands-on experience with cloud data platforms (AWS / GCP / Azure).
  • Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).
  • Strong analytical, architectural, and leadership skills.
  • Create a job alert for this search

    Engineering Manager • ludhiana, punjab, in

    Related jobs
    E-commerce Technical Project Manager( Bigcommerce / Shopify)

    E-commerce Technical Project Manager( Bigcommerce / Shopify)

    Upbott Consulting, Inc • Ludhiana, Punjab, India
    Job Description Location : Remote Job Type : Contract About the Role : We are seeking an E-commerce Technical Project Manager with mandatory hands-on experience in managing BigCommerce or Shopi...Show more
    Last updated: 22 hours ago • Promoted • New!
    Shopify Project Manager

    Shopify Project Manager

    Upbott Consulting, Inc • ludhiana, punjab, in
    E-commerce Project Manager- Shopify / Big commerce.BigCommerce or Shopify projects.Candidates must have led end-to-end e-commerce implementations specifically on. This role requires someone who under...Show more
    Last updated: 1 day ago • Promoted
    Project Manager

    Project Manager

    Hello Energy • ludhiana, punjab, in
    Client & Utility data connections onboarding.We are looking for a Project Manager who can coordinate and deliver multiple small-to-medium projects simultaneously, each representing a client’s build...Show more
    Last updated: 1 day ago • Promoted
    AI Business Analyst

    AI Business Analyst

    Aventis Solutions • ludhiana, punjab, in
    Aventis Solutions is igniting the AI revolution : .They have just launched The AI Executive podcast, which can be found here : . MMQBvaKxQSuXcZ2MLnv?si=f8fb3c2cd9ee4d12.Now, our tech partner is establis...Show more
    Last updated: 4 hours ago • Promoted • New!
    AI / ML Developer

    AI / ML Developer

    Cozzera • ludhiana, punjab, in
    Job Title : AI / ML Builder – Salesforce + Generative AI.We are seeking a highly skilled.The ideal candidate will design and implement intelligent, secure, and scalable AI-driven solutions using.Einst...Show more
    Last updated: 4 hours ago • Promoted • New!
    Full Stack Engineer

    Full Stack Engineer

    Insight Global • ludhiana, punjab, in
    Contract with Insight Global Client.React, React Native, TypeScript.React, React Native, and TypeScript.Deploy containerized solutions using. Ensure high-quality deliverables through.CI / CD pipelines...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer

    Senior Data Engineer

    Primesoft Inc • ludhiana, punjab, in
    Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
    Last updated: 30+ days ago • Promoted
    Generative AI Engineer

    Generative AI Engineer

    Live Connections • ludhiana, punjab, in
    Required Notice Period - Immediate Joiners or Serving Notice or 30 days.Bachelor’s in CS / ML / AI or related field; Master’s or PhD preferred. ML / Data Science with a focus on generative AI, LLMs, or co...Show more
    Last updated: 16 days ago • Promoted
    Data Engineer

    Data Engineer

    Grantify • ludhiana, punjab, in
    Grantify is an innovative education platform that connects students and universities through a transparent admissions and tuition-matching ecosystem. By aligning student budgets and academic aspirat...Show more
    Last updated: 1 day ago • Promoted
    AI Analyst

    AI Analyst

    Aventis Solutions • ludhiana, punjab, in
    Aventis Solutions is igniting the AI revolution : .They have just launched The AI Executive podcast, which can be found here : . Now, our tech partner is establishing a new AI Innovation Hub in Pune, In...Show more
    Last updated: 30+ days ago • Promoted
    Sr. Azure Data Architect & Presales Solution

    Sr. Azure Data Architect & Presales Solution

    Programmers.io • ludhiana, punjab, in
    We offer a vibrant and collaborative work environment, cutting-edge tools and technologies, and ample opportunities for professional growth. Job Title : Azure Data Architect.Experience required : 15+ ...Show more
    Last updated: 18 days ago • Promoted
    AI / ML Engineer

    AI / ML Engineer

    Cozzera • ludhiana, punjab, in
    We are looking for an experienced AI / ML Engineer with a strong background in machine learning and deep learning, especially in time-series, sensor, and behavioral data. Strong foundation in ML and d...Show more
    Last updated: 4 hours ago • Promoted • New!
    Engineering Manager

    Engineering Manager

    ANSR • Ludhiāna, Republic Of India, IN
    About 4flow : Headquartered in Berlin, Germany, 4flow provides consulting, software and services for logistics and supply chain management. More than 1300 team members leverage their supply chain expe...Show more
    Last updated: 30+ days ago • Promoted
    Full Stack Engineer

    Full Stack Engineer

    Programmers.io • ludhiana, punjab, in
    Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
    Last updated: 14 days ago • Promoted
    SageMaker Administrator

    SageMaker Administrator

    Digivance Solutions • ludhiana, punjab, in
    Senior AI Engineer / SageMaker Administrator.Location : Bangalore / Pune / Mysore / Hyderabad.Experience : 8+ years (3+ years relevant in AI Engineering / AWS / SageMaker). Shift Timing : 10 : 30 AM – 8 : ...Show more
    Last updated: 4 hours ago • Promoted • New!
    Data Scientist

    Data Scientist

    Recro • ludhiana, punjab, in
    We’re seeking a highly skilled, hands-on Data Scientist with 4–10 years of experience in applied AI / ML to join our fast-paced team. This role requires deep expertise in transformer architectures and...Show more
    Last updated: 30+ days ago • Promoted
    Freelance Data Engineer

    Freelance Data Engineer

    Leading MNC • ludhiana, punjab, in
    Looking for a Freelance Data Engineer to join a team of rockstar developers.The candidate should have a minimum of 8+ yrs. If you're looking for freelance / part time opportunity (along with your day...Show more
    Last updated: 14 days ago • Promoted
    Lead Full-Stack + AI Engineer (Founding Team)

    Lead Full-Stack + AI Engineer (Founding Team)

    Grovio AI • ludhiana, India
    We’re building an autonomous, multi-agent AI OS that plans, executes, and optimizes marketing across modern digital ecosystems. Think : an AI that acts like a virtual CMO — planning, writing, analyz...Show more
    Last updated: 11 hours ago • Promoted • New!