Talent.com
Technical Lead – Web Crawling Systems, Data Pipelines
Technical Lead – Web Crawling Systems, Data PipelinesAIMLEAP • Prayagraj(Allahabad), IN
No longer accepting applications
Technical Lead – Web Crawling Systems, Data Pipelines

Technical Lead – Web Crawling Systems, Data Pipelines

AIMLEAP • Prayagraj(Allahabad), IN
1 day ago
Job description

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

  • Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.
  • Expertise in designing, building, and optimizing scalable data pipelines, preferably using workflow orchestration tools such as Airflow or Celery.
  • Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.
  • Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.
  • Deep understanding of web crawling frameworks, proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).
  • Strong expertise in AI-driven automation, including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

  • Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.
  • Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.
  • Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.
  • Establish pipeline orchestration using Airflow, Celery, or similar distributed processing technologies.
  • Define and enforce data quality, validation, and security measures across all data flows and pipelines.
  • Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.
  • Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.
  • Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.
  • Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling
  • Qualifications

  • Bachelor's or master's degree in engineering, Computer Science, or related field.
  • 7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems.
  • Strong expertise in Python, SQL, and modern data processing practices.
  • Experience working with Airflow, Celery, or similar workflow automation tools.
  • Solid understanding of proxy systems, anti-bot techniques, and scalable crawler architecture.
  • Hands-on experience with cloud data platforms (AWS / GCP / Azure).
  • Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).
  • Strong analytical, architectural, and leadership skills.
  • Create a job alert for this search

    Technical Lead • Prayagraj(Allahabad), IN

    Related jobs
    Technical Lead

    Technical Lead

    Mphasis • Prayagraj(Allahabad), IN
    Looking for Senior Ingenium Developer with 10+ years' experience and following skills.Experience in Mainframe O / S and Development using COBOL programming language & JCL. Experience in development an...Show more
    Last updated: 14 days ago • Promoted
    Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

    Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

    AIMLEAP • Prayagraj, Uttar Pradesh, India
    Data Engineering Manager – Web Crawling & Pipeline Architecture Experience : 7 to 12 Years Location : Remote / Bangalore Engagement : Full-time Positions : 2 Qualification : B.Tech / MCA / Computer...Show more
    Last updated: 3 hours ago • Promoted • New!
    AWS Tech Lead - Contract

    AWS Tech Lead - Contract

    Gravity Infosolutions, Inc. • Prayagraj(Allahabad), IN
    Participate in refining and scoping upcoming sprint work.Assist solution architects with technical design and breaking down complex tasks. Accountable for timely delivery of assigned tickets, meetin...Show more
    Last updated: 12 hours ago • Promoted • New!
    Data Center Site Lead- OCI

    Data Center Site Lead- OCI

    Oracle • AU
    As a Site Lead for Oracle Data Centers, you will be the technical liaison between the technology teams and the Data Center Environment and will be key in maintaining the Operational run aspects.You...Show more
    Last updated: 30+ days ago
    Project Technical Lead

    Project Technical Lead

    Brunel • AU
    The Project Lead must possess strong expertise in Agile methodologies and SCRUM practices, effectively applying these principles to drive project success. They will work collaboratively with cross-f...Show more
    Last updated: 28 days ago
    Semantic Modeler Lead

    Semantic Modeler Lead

    Accenture • AU
    Join our Data & AI practice as a Semantic Modeler Lead, where you will define and lead the development of the enterprise semantic layer that enables consistent meaning, interoperability, and in...Show more
    Last updated: 2 days ago
    E-commerce Technical Project Manager( Bigcommerce / Shopify)

    E-commerce Technical Project Manager( Bigcommerce / Shopify)

    Upbott Consulting, Inc • Prayagraj(Allahabad), IN
    E-commerce Technical Project Manager.BigCommerce or Shopify projects.Candidates must have led end-to-end e-commerce implementations specifically on. This role requires someone who understands the Bi...Show more
    Last updated: 6 days ago • Promoted
    QA Team Leader

    QA Team Leader

    Huxley • Prayagraj(Allahabad), IN
    QA Team Leader – E-Commerce Platform.Join a dynamic development team responsible for building and maintaining large-scale online commerce services. This role focuses on a popular cashback and reward...Show more
    Last updated: 9 days ago • Promoted
    Lead Engineer

    Lead Engineer

    Hyqoo • Prayagraj(Allahabad), IN
    Design, deploy, and manage AWS cloud infrastructure, including EC2 instances, S3 buckets, VPCs, RDS databases, and Lambda functions. Assist in the design, implementation, and maintenance of backup, ...Show more
    Last updated: 25 days ago • Promoted
    Data Center Site Lead- OCI

    Data Center Site Lead- OCI

    Cerner • AU
    As a Site Lead for Oracle Data Centers, you will be the technical liaison between the technology teams and the Data Center Environment and will be key in maintaining the Operational run aspects.You...Show more
    Last updated: 30+ days ago
    Full-Stack Lead Developer High End Wellness Hospitality(Equity Only - Remote) Worldwide

    Full-Stack Lead Developer High End Wellness Hospitality(Equity Only - Remote) Worldwide

    Pranissa • Prayagraj(Allahabad), IN
    Remote
    Pranissa is a top-tier wellness and longevity platform connecting individuals with exceptional Wellness and longevity destinations, evidence-based wellness resorts, and age-defying experiences worl...Show more
    Last updated: 30+ days ago • Promoted
    Webflow Developer (Finsweet Client-First + CMS-Driven Build)

    Webflow Developer (Finsweet Client-First + CMS-Driven Build)

    RB Law • Prayagraj(Allahabad), IN
    We need a Webflow developer who can.Webflow using best practices, correct naming conventions, and scalable CMS structures. The goal is to ensure the marketing team can easily maintain and expand the...Show more
    Last updated: 2 hours ago • Promoted • New!
    Tech Lead (Full Stack | React + Python)

    Tech Lead (Full Stack | React + Python)

    Aumne AI • Prayagraj(Allahabad), IN
    Aumne AI is building next-gen AI systems for customer experience.We’re looking for a frontend-strong Tech Lead who values clean design, simple architecture, and fast execution.Lead UI architecture ...Show more
    Last updated: 12 hours ago • Promoted • New!
    Tech Lead –.Net / Python & AI

    Tech Lead –.Net / Python & AI

    Skillvera • Prayagraj(Allahabad), IN
    Technical Skills & Stack Requirements : .API development, and service orchestration.AWS or Azure cloud architecture.Bedrock, Lambda, ECS / EKS, Step Functions, S3, and SageMaker and Azure equivalents.U...Show more
    Last updated: 2 hours ago • Promoted • New!
    •AI Technical Lead •

    •AI Technical Lead •

    Accenture • AU
    Accenture is a global professional services company with leading capabilities in digital, cloud and security.Find out more about us at accenture. We are seeking an innovative and results-driven AI E...Show more
    Last updated: 20 days ago
    Site Reliability Engineer

    Site Reliability Engineer

    Capgemini • Prayagraj(Allahabad), IN
    Choosing Capgemini means choosing a company where you will be empowered to shape your career in the way you’d like, where you’ll be supported and inspired by a collaborative community of colleagues...Show more
    Last updated: 30+ days ago • Promoted
    Cybersecurity Lead(6 months contract)

    Cybersecurity Lead(6 months contract)

    Sekuro Asia • Prayagraj(Allahabad), IN
    Our client oversees and operates digital asset-related businesses.Our client aims to transform the financial industry by building a tech-enabled institutional grade ecosystem for issuance, distribu...Show more
    Last updated: 9 days ago • Promoted
    Technical Lead

    Technical Lead

    RapidBrains • Prayagraj(Allahabad), IN
    We are looking for an experienced Technical Lead who can architect scalable systems, mentor development teams, and guide complex projects from concept to deployment. You’ll partner closely with Prod...Show more
    Last updated: 3 days ago • Promoted