Talent.com
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)AIMLEAP • Trivandrum, Kerala, India
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

AIMLEAP • Trivandrum, Kerala, India
15 hours ago
Job description

Data Engineering Manager – Web Crawling & Pipeline Architecture

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.

Expertise in designing, building, and optimizing scalable data pipelines , preferably using workflow orchestration tools such as Airflow or Celery .

Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.

Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.

Deep understanding of web crawling frameworks , proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).

Strong expertise in AI-driven automation , including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.

Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.

Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.

Establish pipeline orchestration using Airflow, Celery , or similar distributed processing technologies.

Define and enforce data quality, validation, and security measures across all data flows and pipelines.

Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.

Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.

Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.

Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling

Qualifications

Bachelor's or master's degree in engineering, Computer Science, or related field.

7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems .

Strong expertise in Python, SQL , and modern data processing practices.

Experience working with Airflow, Celery , or similar workflow automation tools.

Solid understanding of proxy systems, anti-bot techniques , and scalable crawler architecture.

Hands-on experience with cloud data platforms (AWS / GCP / Azure).

Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).

Strong analytical, architectural, and leadership skills.

Create a job alert for this search

Engineering Manager • Trivandrum, Kerala, India

Related jobs
Senior Data Architect- Snowflake

Senior Data Architect- Snowflake

USEReady • Trivandrum, Kerala, India
USEReady is a data and analytics firm that provides the strategies, tools, capability, and capacity that businesses need to turn their data into a competitive advantage. USEReady partners with cloud...Show more
Last updated: 14 days ago • Promoted
Customer Success Manager

Customer Success Manager

CareStack - Dental Practice Management • Trivandrum, Kerala, India
Responsible for customer health, user satisfaction, overall growth and churn prevention.Continuous monitoring of feature adoption across customers ensuring higher degree of.Drive product initiative...Show more
Last updated: 23 hours ago • Promoted
Engineering Manager

Engineering Manager

Confidential • Trivandrum, Kerala, India
The ideal candidate will be responsible for managing and inspiring his or her team to achieve their performance metrics.Your role will involve strategizing, project management, part staff managemen...Show more
Last updated: 3 days ago • Promoted
Senior AI Engineer

Senior AI Engineer

IBS Software • Trivandrum, Kerala, India
Trivandrum / Kochi / Chennai / Bangalore.We are seeking a talented AI Engineer to join our Data & AI Center of Excellence. You will join to build AI-first capabilities across IBS products and customer sol...Show more
Last updated: 9 days ago • Promoted
UI / UX Designer

UI / UX Designer

Cocopalms • Trivandrum, Kerala, India
We are looking for a creative and detail-oriented.The ideal candidate will have a strong sense of design aesthetics, usability principles, and experience in creating engaging digital experiences an...Show more
Last updated: 4 days ago • Promoted
Senior Data Engineer - Data Acquisition

Senior Data Engineer - Data Acquisition

InfoBeans • Trivandrum, Kerala, India
About the Job We are seeking a highly skilled Senior Data Engineer – Data Acquisition (ODS) to join our data engineering team. The ideal candidate will have extensive hands-on experience in buildi...Show more
Last updated: 23 days ago • Promoted
Data Engineer (Snowflake + Databricks)

Data Engineer (Snowflake + Databricks)

MyRemoteTeam Inc • Trivandrum, Kerala, India
About Us MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent. We empower businesses by providing world-class software engineers, operati...Show more
Last updated: 7 hours ago • Promoted • New!
Project Manager – Data Engineering & Analytics

Project Manager – Data Engineering & Analytics

Brillio • Trivandrum, Kerala, India
About the Company : We are looking for a skilled Technical Project Manager to lead and deliver projects in data engineering and analytics. You will manage cross-functional teams to execute data plat...Show more
Last updated: 30+ days ago • Promoted
Senior AI Engineer

Senior AI Engineer

Lexoga • Trivandrum, Kerala, India
We’re hiring for a market-leading edge computing startup that’s building AI infrastructure for remote and low-connectivity environments. Their mission is to power real-time, on-premise intelligence ...Show more
Last updated: 30+ days ago • Promoted
Junior Web Designer

Junior Web Designer

Green Analytic Solutions (GCS) • Trivandrum, Kerala, India
Green Analytic Solutions is built on a foundation of business sustainability consultancy.As we continue our growth journey, we are broadening our expertise into. To strengthen this expansion, we are...Show more
Last updated: 13 hours ago • Promoted • New!
Sr Manager Analytics

Sr Manager Analytics

Live Connections • Trivandrum, Kerala, India
Role - Sr Manager Analytics Experience - 8+ years Work Location - Remote Required Notice Period - Immediate Joiners or Serving Notice Period Must Have Skills 8+ years of working experience in ML &...Show more
Last updated: 30+ days ago • Promoted
Lead -Applied AI Researcher

Lead -Applied AI Researcher

Flytxt • Trivandrum, Kerala, India
As Applied AI Researcher, you will define and execute a research-intensive AI and data science agenda.You will lead foundational model research, drive agentic AI innovation, and publish advances th...Show more
Last updated: 13 days ago • Promoted
Senior Data Engineer

Senior Data Engineer

Donyati • Trivandrum, Kerala, India
Senior Data Engineer Job Title : Senior Data Engineer Location : Remote / Willing to Travel Job Type : Full-time Experience Level : 8+ years About the Role : We are seeking a highly skilled Senior ...Show more
Last updated: 16 days ago • Promoted
Principal Data Engineer

Principal Data Engineer

CodeMyMobile • Trivandrum, Kerala, India
Experience Required - 7 to 10 Years How to Apply : Are you a Data Engineer who cares about clean engineering, autonomy, and solving real data challenges? If this sounds like you, we’d love to conn...Show more
Last updated: 30+ days ago • Promoted
Engineering Director - Python / AI / ML

Engineering Director - Python / AI / ML

Aarav Solutions • Trivandrum, Kerala, India
We are NOT working with any agencies for this Role.Candidates are encouraged to only apply through LinkedIn.Company Description Aarav Solutions is a global leader in product engineering and IT c...Show more
Last updated: 7 hours ago • Promoted • New!
Engineering Manager

Engineering Manager

Cargoz.com • Trivandrum, Kerala, India
We are seeking an experienced Engineering Manager to lead and scale a high-performing engineering team within our fast-growing technology startup. This role is perfect for leaders who excel in dyn...Show more
Last updated: 11 hours ago • Promoted • New!
AWS Data Architect

AWS Data Architect

ACL Digital • Trivandrum, Kerala, India
AWS (S3, Redshift, Glue, Lake Formation, IAM).Proficient in data modeling, performance tuning, and security best practices. AWS Certified Solutions Architect preferred.Show more
Last updated: 21 days ago • Promoted
Lead Data Engineer

Lead Data Engineer

Ironbook AI • Trivandrum, Kerala, India
We are seeking an experienced and driven Lead Data Engineer to spearhead the design and development of a modern, cloud-native data warehouse on AWS. This role is critical to building a scalable, sec...Show more
Last updated: 7 hours ago • Promoted • New!