Talent.com
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)AIMLEAP • Belgaum, Karnataka, India
Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

Data Engineering Manager – Web Crawling & Pipeline Architecture (2 to 7yrs)

AIMLEAP • Belgaum, Karnataka, India
16 hours ago
Job description

Data Engineering Manager – Web Crawling & Pipeline Architecture

Experience : 7 to 12 Years

Location : Remote / Bangalore

Engagement : Full-time

Positions : 2

Qualification : B.E / B.Tech / M.Tech / MCA / Computer Science / IT

Industry : IT / Data / AI / E-commerce / FinTech / Healthcare

Notice Period : Immediate

What We Are Looking For

Proven experience leading data engineering teams with strong ownership of web crawling systems and pipeline architecture.

Expertise in designing, building, and optimizing scalable data pipelines , preferably using workflow orchestration tools such as Airflow or Celery .

Hands-on proficiency in Python and SQL for data extraction, transformation, processing, and storage.

Experience working with cloud platforms such as AWS, GCP, or Azure for data infrastructure, deployments, and pipeline operations.

Deep understanding of web crawling frameworks , proxy rotation, anti-bot strategies, session handling, and compliance with global data collection standards (GDPR / CCPA-safe crawling).

Strong expertise in AI-driven automation , including integrating AI agents or frameworks like Crawl4ai into scraping, validation, and pipeline workflows..

Responsibilities

Lead and mentor data engineering and web crawling teams, ensuring high-quality delivery and adherence to best practices.

Architect, implement, and optimize scalable data pipelines that support high-volume data ingestion, transformation, and storage.

Build and maintain robust crawling systems using modern frameworks, handling IP rotation, throttling, and dynamic content extraction.

Establish pipeline orchestration using Airflow, Celery , or similar distributed processing technologies.

Define and enforce data quality, validation, and security measures across all data flows and pipelines.

Collaborate with product, engineering, and analytics teams to translate data requirements into scalable technical solutions.

Develop monitoring, logging, and performance metrics to ensure high availability and reliability of data systems.

Oversee cloud-based deployments, cost optimization, and infrastructure improvements on AWS / GCP / Azure.

Integrate AI agents or LLM-based automation for tasks such as error resolution, data validation, enrichment, and adaptive crawling

Qualifications

Bachelor's or master's degree in engineering, Computer Science, or related field.

7–12 years of relevant experience in data engineering, pipeline design, or large-scale web crawling systems .

Strong expertise in Python, SQL , and modern data processing practices.

Experience working with Airflow, Celery , or similar workflow automation tools.

Solid understanding of proxy systems, anti-bot techniques , and scalable crawler architecture.

Hands-on experience with cloud data platforms (AWS / GCP / Azure).

Experience with AI / LLM frameworks (Crawl4ai, LangChain, LlamaIndex, AutoGen, OpenAI, or similar).

Strong analytical, architectural, and leadership skills.

Create a job alert for this search

Engineering Manager • Belgaum, Karnataka, India

Related jobs
Data Engineer

Data Engineer

IntraEdge • Belgaum, Karnataka, India
Job Title : Data Engineer Experience : 5–9 years Location : [Remote] Job Summary : We are looking for a skilled Data Engineer with strong expertise in Python, PySpark, AWS services (Glue, Lambda)...Show more
Last updated: 30+ days ago • Promoted
Sr Manager Analytics

Sr Manager Analytics

Live Connections • Belgaum, Karnataka, India
Role - Sr Manager Analytics Experience - 8+ years Work Location - Remote Required Notice Period - Immediate Joiners or Serving Notice Period Must Have Skills 8+ years of working experience in ML &...Show more
Last updated: 30+ days ago • Promoted
Senior Data Engineer - Data Acquisition

Senior Data Engineer - Data Acquisition

InfoBeans • Belgaum, Karnataka, India
About the Job We are seeking a highly skilled Senior Data Engineer – Data Acquisition (ODS) to join our data engineering team. The ideal candidate will have extensive hands-on experience in buildi...Show more
Last updated: 23 days ago • Promoted
Lead Data Engineer

Lead Data Engineer

Guidanz Inc • Belgaum, Karnataka, India
About BI Connector BI Connector is the industry leading solution for integrating Oracle Fusion Cloud data into modern BI platforms like Power BI, Tableau, and Data Warehouse, without complex ETL.Ou...Show more
Last updated: 1 day ago • Promoted
Project Manager – Data Engineering & Analytics

Project Manager – Data Engineering & Analytics

Brillio • Belgaum, Karnataka, India
About the Company : We are looking for a skilled Technical Project Manager to lead and deliver projects in data engineering and analytics. You will manage cross-functional teams to execute data plat...Show more
Last updated: 30+ days ago • Promoted
Engineering Manager

Engineering Manager

Cargoz.com • Belgaum, Karnataka, India
We are seeking an experienced Engineering Manager to lead and scale a high-performing engineering team within our fast-growing technology startup. This role is perfect for leaders who excel in dyn...Show more
Last updated: 16 hours ago • Promoted • New!
Data Engineer

Data Engineer

MyRemoteTeam Inc • Belgaum, Karnataka, India
About Us MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent. We empower businesses by providing world-class software engineers, operati...Show more
Last updated: 8 hours ago • Promoted • New!
Lead Data Engineer

Lead Data Engineer

Confidential • Belgaum, Karnataka, India
Skillset Required • 7+ years of experience in software development, with a strong foundation in distributed systems, cloud-native architectures, and data platforms. Expertise in big data technologie...Show more
Last updated: 8 hours ago • Promoted • New!
Performance Marketing Manager

Performance Marketing Manager

Scratchpad.inc • Belgaum, Karnataka, India
This role is all about driving measurable growth through data-led decisions, sharp execution, and close collaboration with creative and tech teams. You’ll own performance outcomes end-to-end from st...Show more
Last updated: 9 days ago • Promoted
Senior Data Engineer

Senior Data Engineer

Ironbook AI • Belgaum, Karnataka, India
Role Summary We are looking for a Senior Data Engineer to design, build, and optimize high-performance data pipelines and data systems. The ideal candidate will have strong experience with cloud p...Show more
Last updated: 8 hours ago • Promoted • New!
Senior Data Engineer

Senior Data Engineer

Donyati • Belgaum, Karnataka, India
Senior Data Engineer Job Title : Senior Data Engineer Location : Remote / Willing to Travel Job Type : Full-time Experience Level : 8+ years About the Role : We are seeking a highly skilled Senior D...Show more
Last updated: 16 days ago • Promoted
Lead Data Engineer

Lead Data Engineer

Ironbook AI • Belgaum, Karnataka, India
We are seeking an experienced and driven Lead Data Engineer to spearhead the design and development of a modern, cloud-native data warehouse on AWS. This role is critical to building a scalable, sec...Show more
Last updated: 8 hours ago • Promoted • New!
Data Engineer (Snowflake + Databricks)

Data Engineer (Snowflake + Databricks)

MyRemoteTeam Inc • Belgaum, Karnataka, India
About Us MyRemoteTeam, Inc is a fast-growing distributed workforce enabler, helping companies scale with top global talent. We empower businesses by providing world-class software engineers, operati...Show more
Last updated: 8 hours ago • Promoted • New!
Data Architect (12+years)

Data Architect (12+years)

MindBrain • Belgaum, Karnataka, India
Key Responsibilities Define and execute the enterprise data architecture strategy in alignment with business goals and future growth needs. Architect, design, and implement Data Lakes, Data Warehou...Show more
Last updated: 8 hours ago • Promoted • New!
AWS Data Architect

AWS Data Architect

ACL Digital • Belgaum, Karnataka, India
AWS (S3, Redshift, Glue, Lake Formation, IAM).Proficient in data modeling, performance tuning, and security best practices. AWS Certified Solutions Architect preferred.Show more
Last updated: 21 days ago • Promoted
Digital Marketer & Lead Generation Executive

Digital Marketer & Lead Generation Executive

Enerzi • Belgaum, Karnataka, India
India’s leading industrial microwave systems and clean-hydrogen innovation company, is expanding its Growth & Marketing team. Digital Marketer & Lead Generation Executive.B2B leads, and amplify Ener...Show more
Last updated: 15 hours ago • Promoted • New!
Engineering Director - Python / AI / ML

Engineering Director - Python / AI / ML

Aarav Solutions • Belgaum, Karnataka, India
We are NOT working with any agencies for this Role.Candidates are encouraged to only apply through LinkedIn.Company Description Aarav Solutions is a global leader in product engineering and IT c...Show more
Last updated: 8 hours ago • Promoted • New!
Data Engineer - Web Scraping

Data Engineer - Web Scraping

Alternative Path • Belgaum, Karnataka, India
Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm.In this role, you will collaborate with individuals across various company de...Show more
Last updated: 30+ days ago • Promoted