Talent.com
Technical Lead – Web Crawling Systems, Data Pipelines
Technical Lead – Web Crawling Systems, Data PipelinesAIMLEAP • Hyderabad, Telangana, India
No longer accepting applications
Technical Lead – Web Crawling Systems, Data Pipelines

Technical Lead – Web Crawling Systems, Data Pipelines

AIMLEAP • Hyderabad, Telangana, India
1 day ago
Job description

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.

Expertise in designing, building, and optimizing scalable data pipelines, preferably using workflow orchestration tools such as Airflow or Celery.

Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.

Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.

Deep understanding of web crawling frameworks, proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).

Strong expertise in AI-driven automation, including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.

Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.

Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.

Establish pipeline orchestration using Airflow, Celery, or similar distributed processing technologies.

Define and enforce data quality, validation, and security measures across all data flows and pipelines.

Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.

Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.

Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.

Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling

Qualifications

Bachelor's or master's degree in engineering, Computer Science, or related field.

7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems.

Strong expertise in Python, SQL, and modern data processing practices.

Experience working with Airflow, Celery, or similar workflow automation tools.

Solid understanding of proxy systems, anti-bot techniques, and scalable crawler architecture.

Hands-on experience with cloud data platforms (AWS / GCP / Azure).

Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).

Strong analytical, architectural, and leadership skills.

Create a job alert for this search

Technical Lead • Hyderabad, Telangana, India

Related jobs
Tech Lead

Tech Lead

CherryTechSolutions • Hyderabad, Telangana, India
We are looking for an experienced, skilled, enthusiastic, and energetic.He / she will analyze and improve upon technology standards across the organization to maintain a technological and competitive...Show more
Last updated: 30+ days ago • Promoted
Aws Technical Lead

Aws Technical Lead

Tata Consultancy Services • Hyderabad, Republic Of India, IN
AWS (Quick Sight, Step Functions, Athena, S3, Glue), SQL, Documentation, Reporting, MS Office.Hands-on experience with Amazon Quick Sight, Step Functions, Athena, and S3. Strong SQL skills for data ...Show more
Last updated: 17 days ago • Promoted
Web Analytics and Optimization Specialist

Web Analytics and Optimization Specialist

vidaXL • Hyderabad, Republic Of India, IN
A web analyst is responsible for collecting and analyzing data on user interaction with the platform and on the overall operation of the platform itself. You evaluate the conformity of all products ...Show more
Last updated: 30+ days ago • Promoted
Liferay Tech Lead

Liferay Tech Lead

ValueMomentum • Hyderabad, Telangana, India
We are seeking a highly skilled and hands-on.The ideal candidate will be responsible for designing optimal solutions, leading development efforts, and providing technical guidance to the team.Stron...Show more
Last updated: 7 days ago • Promoted
Technical Lead, Revenue Cloud

Technical Lead, Revenue Cloud

OSF Digital • Hyderabad, Republic Of India, IN
At OSF Digital, we are at the forefront of digital innovation, transforming businesses across the globe with cutting-edge solutions. As a trusted partner in the digital landscape, we empower organiz...Show more
Last updated: 11 hours ago • Promoted • New!
Technical Lead

Technical Lead

DiLytics • Hyderabad, Telangana, India
DiLytics is a leading Information Technology (IT) Services provider completely focused on providing services in Analytics Business Intelligence Data Warehousing Data Integration and Enterprise Perf...Show more
Last updated: 30+ days ago • Promoted
Technical Lead

Technical Lead

RapidBrains • Hyderabad, IN
We are looking for an experienced Technical Lead who can architect scalable systems, mentor development teams, and guide complex projects from concept to deployment. You’ll partner closely with Prod...Show more
Last updated: 3 days ago • Promoted
Technical Lead

Technical Lead

Birlasoft • Hyderabad, Republic Of India, IN
We are seeking a highly skilled.This role demands strong expertise in backend and frontend technologies, cloud platforms, and modern AI-native engineering tools to accelerate productivity and innov...Show more
Last updated: 13 days ago • Promoted
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

AIMLEAP • Hyderabad, Telangana, India
Data Engineering Manager – Web Crawling & Pipeline Architecture Experience : 7 to 12 Years Location : Remote / Bangalore Engagement : Full-time Positions : 2 Qualification : B.Tech / MCA / Compute...Show more
Last updated: 7 hours ago • Promoted • New!
Technical Lead

Technical Lead

AutoRABIT Holding Inc. • Hyderabad, Telangana, IN
Quick Apply
AutoRABIT Profile AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recover...Show more
Last updated: 30+ days ago
Technical Lead

Technical Lead

BeeHyv Software • Hyderabad, Telangana, India
Full-time, Can join immediately.BeeHyv is a premium software engineering partner for firms building innovative world class products and solutions to make an impact. Whether it be a hot Bay Area tech...Show more
Last updated: 23 days ago • Promoted
Technical Product Lead

Technical Product Lead

Live Connections • Hyderabad, Republic Of India, IN
Strong engineering / build background — has built products, not just managed teams.Experience working on the commercial insurance domain too. He / She will have a strong product-building experience and...Show more
Last updated: 3 hours ago • Promoted • New!
Technical Lead

Technical Lead

AutoRABIT • Hyderabad, Telangana, India
AutoRABIT is the leader in DevSecOps for SaaS platforms such as Salesforce.Its unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, ...Show more
Last updated: 30+ days ago • Promoted
Liferay 7.4

Liferay 7.4

ValueMomentum • Hyderabad, Telangana, India
Hyderabad Preferred ( Open to Remote also).We are seeking a highly skilled and hands-on.The ideal candidate will be responsible for designing optimal solutions, leading development efforts, and pro...Show more
Last updated: 17 days ago • Promoted
AWS Technical Lead

AWS Technical Lead

Tata Consultancy Services • Hyderabad, Telangana, India
AWS (Quick Sight, Step Functions, Athena, S3, Glue), SQL, Documentation, Reporting, MS Office.Hands-on experience with Amazon Quick Sight, Step Functions, Athena, and S3. Strong SQL skills for data ...Show more
Last updated: 17 days ago • Promoted
Aws Tech Lead - Contract

Aws Tech Lead - Contract

Gravity Infosolutions, Inc. • Hyderabad, Republic Of India, IN
Participate in refining and scoping upcoming sprint work.Assist solution architects with technical design and breaking down complex tasks. Accountable for timely delivery of assigned tickets, meetin...Show more
Last updated: 3 hours ago • Promoted • New!
Content Team Lead

Content Team Lead

NxtWave • Hyderabad, Telangana, India
As a Content Lead at NxtWave, you will shape the strategic vision and operational excellence behind our technical curriculum, including core areas like MERN Full Stack, Python, and AI.You will lead...Show more
Last updated: 10 days ago • Promoted
AWS Tech Lead - Contract

AWS Tech Lead - Contract

Gravity Infosolutions, Inc. • Hyderabad, IN
Participate in refining and scoping upcoming sprint work.Assist solution architects with technical design and breaking down complex tasks. Accountable for timely delivery of assigned tickets, meetin...Show more
Last updated: 16 hours ago • Promoted • New!