Talent.com
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)AIMLEAP • mohali, India
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

AIMLEAP • mohali, India
3 days ago
Job description

Data Engineering Manager – Web Crawling & Pipeline Architecture

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

  • Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.
  • Expertise in designing, building, and optimizing scalable data pipelines , preferably using workflow orchestration tools such as Airflow or Celery .
  • Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.
  • Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.
  • Deep understanding of web crawling frameworks , proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).
  • Strong expertise in AI-driven automation , including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

  • Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.
  • Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.
  • Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.
  • Establish pipeline orchestration using Airflow, Celery , or similar distributed processing technologies.
  • Define and enforce data quality, validation, and security measures across all data flows and pipelines.
  • Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.
  • Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.
  • Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.
  • Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling
  • Qualifications

  • Bachelor's or master's degree in engineering, Computer Science, or related field.
  • 7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems .
  • Strong expertise in Python, SQL , and modern data processing practices.
  • Experience working with Airflow, Celery , or similar workflow automation tools.
  • Solid understanding of proxy systems, anti-bot techniques , and scalable crawler architecture.
  • Hands-on experience with cloud data platforms (AWS / GCP / Azure).
  • Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).
  • Strong analytical, architectural, and leadership skills.
  • Create a job alert for this search

    Engineering Manager • mohali, India

    Related jobs
    Technical Lead – Web Crawling Systems, Data Pipelines

    Technical Lead – Web Crawling Systems, Data Pipelines

    AIMLEAP • Mohali, Punjab, India
    Experience : 7 to 12 Years Location : Remote / Bangalore Engagement : Full-time Positions : 2 Qualification : B.Tech / MCA / Computer Science / IT Industry : IT / Data / AI / E-commerce / FinTech / Healt...Show more
    Last updated: 6 hours ago • Promoted • New!
    Senior Solutions Architect & Engineering Manager

    Senior Solutions Architect & Engineering Manager

    Lytegen • Sahibzada Ajit Singh Nagar, PB, IN
    Quick Apply
    We are hiring a Senior Solutions Architect & Engineering Manager to design, build, and scale the full technology foundation for our rapidly growing U. This is a critical senior technical leaders...Show more
    Last updated: 10 days ago
    Lead Data Engineer

    Lead Data Engineer

    Guidanz Inc • Mohali, Punjab, India
    About BI Connector BI Connector is the industry leading solution for integrating Oracle Fusion Cloud data into modern BI platforms like Power BI, Tableau, and Data Warehouse, without complex ETL.Ou...Show more
    Last updated: 6 hours ago • Promoted • New!
    Business Development Manager – Shopify Development & AI Commerce

    Business Development Manager – Shopify Development & AI Commerce

    Mandasa Technologies | Shopify Plus Partner • mohali, India
    E-commerce, Shopify Development, AI Automation.We are a Shopify development agency building high-performance storefronts, custom apps, CRO solutions, and AI-driven automations.We help DTC brands sc...Show more
    Last updated: 22 hours ago • Promoted • New!
    Data Engineer

    Data Engineer

    DataCouch • Chandigarh, India, India
    Job Title : Data, AI & Data Engineering Instructor.Experience Required : 2–3 years.We are seeking a highly skilled.Data, AI, and Data Engineering Instructor. The ideal candidate will have strong hands...Show more
    Last updated: 20 days ago • Promoted
    AI Automation Engineer (Internship)

    AI Automation Engineer (Internship)

    Bhalekar Consulting • Chandigarh, Chandigarh, India
    Position Title : AI Automation Engineer (Internship).Learning and Training-Based Internship (Unpaid as per company policy). Assigned Department Supervisor / Mentor.Australia-based consulting firm spe...Show more
    Last updated: 29 days ago • Promoted
    Ecommerce Manager

    Ecommerce Manager

    Mother Sparsh • Chandigarh, Chandigarh, India
    Innovation Rooted in Indian Traditions!.Inspired by the love and challenges that new parents encounter, Mother Sparsh was founded in 2018 by dedicated parents. Who understand the challenges of paren...Show more
    Last updated: 28 days ago • Promoted
    Search Engine Optimization Executive

    Search Engine Optimization Executive

    SEO Discovery Private Limited • Sahibzada Ajit Singh Nagar, Punjab, India
    SEO Discovery is a global leader in delivering next-generation digital marketing services, solutions, and consulting.With over a decade of experience, we have earned a reputation for driving measur...Show more
    Last updated: 15 hours ago • Promoted • New!
    Manager, Software Development Engineering

    Manager, Software Development Engineering

    Zscaler • Mohali, India
    Zscaler accelerates digital transformation so our customers can be more agile efficient resilient and secure.Our cloud native Zero Trust Exchange platform protects thousands of customers from cyber...Show more
    Last updated: 7 days ago • Promoted
    Senior Manager, Full Stack Developer

    Senior Manager, Full Stack Developer

    Zscaler • Mohali, India
    Serving thousands of enterprise customers around the world including 45% of Fortune 500 companies Zscaler (NASDAQ : ZS) was founded in 2007 with a mission to make the cloud a safe place to do busine...Show more
    Last updated: 13 days ago • Promoted
    Transition Manager

    Transition Manager

    AIONOS • Mohali district, India, India
    This role is specific to the travel domain.Artificial Intelligence on Operating Systems.AIonOS is pioneering the shift towards building AI-Native enterprises. AI into core business functions, ensuri...Show more
    Last updated: 23 days ago • Promoted
    Data Engineer

    Data Engineer

    TerraGiG • mohali, India
    Lead the design, development, and implementation of data solutions using AWS and Snowflake.Collaborate with cross-functional teams to understand business requirements and translate them into techni...Show more
    Last updated: 2 days ago • Promoted
    Machine Learning Engineer

    Machine Learning Engineer

    Trantor • Chandigarh, India, India
    Development and integration of Python-based applications with LLMs (OpenAI, DeepSeek, Anthropic, LLaMA, etc.Architect and implement LLM pipelines including prompt engineering, retrieval-augmented g...Show more
    Last updated: 30+ days ago • Promoted
    Marketing Automation Manager

    Marketing Automation Manager

    Bhalekar Consulting • Chandigarh, Chandigarh, India
    Position Title : Marketing Automation Manager.Chandigarh, India (On-site / Hybrid).Business Development & Marketing.Australia-based consulting firm specializing in . Digital Transformation, Data Engi...Show more
    Last updated: 29 days ago • Promoted
    Engineering Manager

    Engineering Manager

    Confidential • mohali, India
    The ideal candidate will be responsible for managing and inspiring his or her team to achieve their performance metrics.Your role will involve strategizing, project management, part staff managemen...Show more
    Last updated: 1 day ago • Promoted
    Amazon Redshift

    Amazon Redshift

    Vidhema Technologies • mohali, India
    Notice Period : Immediate Joiners Preferred.We are looking for an experienced.Amazon Redshift Developer to lead the design, setup, and management of new Redshift projects from the ground up.The idea...Show more
    Last updated: 16 hours ago • Promoted • New!
    AI Data Engineer

    AI Data Engineer

    Turing • Chandigarh, India, India
    We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
    Last updated: 15 hours ago • Promoted • New!
    Data Engineer

    Data Engineer

    Talink • Chandigarh, India, India
    We’re Hiring! Data Engineer – Securiti Platform Specialist.Chandigarh, India (preferred) or Remote (India).Talink is a rapidly growing global Technology Services company delivering cutting-edge sol...Show more
    Last updated: 16 days ago • Promoted