Talent.com
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)AIMLEAP • Tirunelveli, Tamil Nadu, India
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

AIMLEAP • Tirunelveli, Tamil Nadu, India
22 hours ago
Job description

Data Engineering Manager – Web Crawling & Pipeline Architecture

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.

Expertise in designing, building, and optimizing scalable data pipelines , preferably using workflow orchestration tools such as Airflow or Celery .

Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.

Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.

Deep understanding of web crawling frameworks , proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).

Strong expertise in AI-driven automation , including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.

Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.

Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.

Establish pipeline orchestration using Airflow, Celery , or similar distributed processing technologies.

Define and enforce data quality, validation, and security measures across all data flows and pipelines.

Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.

Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.

Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.

Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling

Qualifications

Bachelor's or master's degree in engineering, Computer Science, or related field.

7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems .

Strong expertise in Python, SQL , and modern data processing practices.

Experience working with Airflow, Celery , or similar workflow automation tools.

Solid understanding of proxy systems, anti-bot techniques , and scalable crawler architecture.

Hands-on experience with cloud data platforms (AWS / GCP / Azure).

Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).

Strong analytical, architectural, and leadership skills.

Create a job alert for this search

Engineering Manager • Tirunelveli, Tamil Nadu, India

Related jobs
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 To 2 Yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 To 2 Yrs)

AIMLEAP • Tirunelveli, Republic Of India, IN
Data Engineering Manager – Web Crawling & Pipeline Architecture.Tech / MCA / Computer Science / IT.IT / Data / AI / E-commerce / FinTech / Healthcare. Experience working with cloud platforms such as...Show more
Last updated: 21 hours ago • Promoted • New!
Data Architect

Data Architect

Tech Mahindra • tirunelveli, tamil nadu, in
We are seeking a highly skilled professional who can.ETL processes, and data quality initiatives.Having experience into any Cloud (Azure / GCP / AWS). Proposing solutions to optimize existing.Develo...Show more
Last updated: 26 days ago • Promoted
Freelance Senior Data Engineer (ADF • Databricks • Vectr • Cribl)

Freelance Senior Data Engineer (ADF • Databricks • Vectr • Cribl)

ThreatXIntel • tirunelveli, tamil nadu, in
ThreatXIntel is a startup cybersecurity company focused on delivering advanced and tailored solutions to protect businesses and organizations from cyber threats. Our expertise spans cloud security, ...Show more
Last updated: 1 day ago • Promoted
Data Scientist

Data Scientist

Recro • tirunelveli, tamil nadu, in
We’re seeking a highly skilled, hands-on Data Scientist with 4–10 years of experience in applied AI / ML to join our fast-paced team. This role requires deep expertise in transformer architectures and...Show more
Last updated: 30+ days ago • Promoted
Senior AI Engineer

Senior AI Engineer

Xtnsion.AI • tirunelveli, tamil nadu, in
AI is building the agentic CX layer for modern businesses — AI voice + chat agents that autonomously handle bookings, lead follow-up, support workflows, CRM actions, and more across phone, WhatsApp...Show more
Last updated: 1 day ago • Promoted
Cloud Engineer - Full Remote (Global Cloud-Native Projects)

Cloud Engineer - Full Remote (Global Cloud-Native Projects)

SkillsCapital • tirunelveli, tamil nadu, in
Remote
These long-term, fully remote freelance roles are ideal for engineers with strong hands-on experience in AWS, Azure, or Google Cloud who want to build scalable, secure, high-performance cloud solut...Show more
Last updated: 14 hours ago • Promoted • New!
Architect

Architect

Veltris • tirunelveli, tamil nadu, in
AI Architect - Telecom & Networking.Routing, Switching / SD-WAN / Provider Edge).ML Algorithms; Graph Neural Networks, Time-series Forecasting Algorithms (ARIMA, LSTM…). ML / DL libraries (PyTorch, Te...Show more
Last updated: 1 day ago • Promoted
AI Analyst

AI Analyst

Aventis Solutions • tirunelveli, tamil nadu, in
Aventis Solutions is igniting the AI revolution : .They have just launched The AI Executive podcast, which can be found here : . Now, our tech partner is establishing a new AI Innovation Hub in Pune, In...Show more
Last updated: 30+ days ago • Promoted
Senior Data Engineer

Senior Data Engineer

Primesoft Inc • tirunelveli, tamil nadu, in
Primesoft Enterprise IT Services Pvt.As a Software Engineer II - Data, you will contribute to the design and development of data systems including pipelines, APIs, analytics, AI and machine learnin...Show more
Last updated: 30+ days ago • Promoted
Freelance Data Engineer

Freelance Data Engineer

Leading MNC • tirunelveli, tamil nadu, in
Looking for a Freelance Data Engineer to join a team of rockstar developers.The candidate should have a minimum of 8+ yrs. If you're looking for freelance / part time opportunity (along with your day...Show more
Last updated: 14 days ago • Promoted
Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

Data Engineer - Fully Remote (Global Data Platform & Analytics Projects)

SkillsCapital • tirunelveli, tamil nadu, in
Remote
These fully remote, long-term freelance roles are ideal for engineers who can build scalable data pipelines, work with modern cloud-native data stacks, and support large-scale enterprise data initi...Show more
Last updated: 14 hours ago • Promoted • New!
Full Stack Engineer

Full Stack Engineer

Programmers.io • tirunelveli, tamil nadu, in
Job Title : Senior Full Stack Developer (Laravel + Vue).We are seeking highly skilled Senior Full Stack Developers with 7–10 years of experience in Laravel and modern frontend frameworks (Vue.The ca...Show more
Last updated: 15 days ago • Promoted
Freelance Scala Data Engineer (Airflow • SQL)

Freelance Scala Data Engineer (Airflow • SQL)

ThreatXIntel • tirunelveli, tamil nadu, in
ThreatXIntel is a cybersecurity startup that specializes in providing tailored, cost-effective solutions for businesses and organizations to safeguard their digital assets.As experts in cloud secur...Show more
Last updated: 1 day ago • Promoted
Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture ( 7 to 2 yrs)

AIMLEAP • tirunelveli, tamil nadu, in
Data Engineering Manager – Web Crawling & Pipeline Architecture.Tech / MCA / Computer Science / IT .IT / Data / AI / E-commerce / FinTech / Healthcare . Experience working with cloud platforms such ...Show more
Last updated: 1 day ago • Promoted
AI / ML Engineer

AI / ML Engineer

Cozzera • tirunelveli, tamil nadu, in
We are looking for an experienced AI / ML Engineer with a strong background in machine learning and deep learning, especially in time-series, sensor, and behavioral data. Strong foundation in ML and d...Show more
Last updated: 1 day ago • Promoted
Full Stack Engineer

Full Stack Engineer

AideWiser SolTek • tirunelveli, tamil nadu, in
AWS (EC2, Lambda, S3, RDS, DynamoDB, etc.Design, develop, and maintain backend services using.Net Core / MVC and frontend components using React. Build and scale backend systems on AWS cloud infrastru...Show more
Last updated: 30+ days ago • Promoted
Business Develop Manager

Business Develop Manager

Grantify • tirunelveli, tamil nadu, in
Grantify is an innovative education platform that bridges students and universities through a transparent admissions and tuition-matching system. By aligning student budgets and academic goals with ...Show more
Last updated: 1 day ago • Promoted
Sr. Azure Data Architect & Presales Solution

Sr. Azure Data Architect & Presales Solution

Programmers.io • tirunelveli, tamil nadu, in
We offer a vibrant and collaborative work environment, cutting-edge tools and technologies, and ample opportunities for professional growth. Job Title : Azure Data Architect.Experience required : 15+ ...Show more
Last updated: 19 days ago • Promoted