Talent.com
Data Engineer - Web Scraping
Data Engineer - Web ScrapingAlternative Path • Sangli, India
No longer accepting applications
Data Engineer - Web Scraping

Data Engineer - Web Scraping

Alternative Path • Sangli, India
30+ days ago
Job description

Alternative Path is seeking skilled software developers to collaborate on client projects with an asset management firm. In this role, you will collaborate with individuals across various company departments to shape and innovate new products and features for our platform, enhancing existing ones. You will have a large degree of independence and trust, but you won't be isolated; the support of the Engineering team leads, the Product team leads, and every other technology team member is behind you. This is an opportunity to join a team-first meritocracy and help grow an entrepreneurial group inside Alternative Path. You will be asked to contribute, given ownership, and will be expected to make your voice heard.

Role Summary :

Performing Web Scraping using various scraping techniques and then utilizing Python’s Pandas library for data cleaning and manipulation. Then ingesting the data into a Database / Warehouse, and scheduling the scrapers using Airflow or other tools

Role Overview

The Web Scraping Team at Alternative Path is seeking a creative and detail-oriented developer to contribute to client projects. The team develops essential applications, datasets, and alerts for various teams within the client's organization, supporting their daily investment decisions. The mission is to maintain operational excellence by delivering high-quality proprietary datasets, timely notifications, and exceptional service. We are seeking someone who is self-motivated, self-sufficient, with a passion for tinkering and a love for automation.

In your role, you will :

➢ Collaborate with analysts to understand and anticipate requirements.

➢ Design, implement, and maintain Web scrapers for a wide variety of alternative datasets.

➢ Perform Data Cleaning, Exploration, Transformation etc. of scraped data.

➢ Collaborate with cross-functional teams to understand data requirements and implement efficient data processing workflows.

➢ Author QC checks to validate data availability and integrity.

➢ Maintain alerting systems and investigate time-sensitive data incidents to ensure smooth day-to-day operations.

➢ Design and implement products and tools to enhance the Web scraping Platform.

Qualifications

Must have

➢ Bachelor's / master’s degree in computer science or in any related field

➢ 3-5 years of software development experience

➢ Strong Python and SQL / Database skills

➢ Strong expertise in using the Pandas library (Python) is a must

➢ Experience with web technologies (HTML / JS, APIs, etc.)

➢ Proven work experience in working with large data sets for Data cleaning, Data transformation, Data manipulation, and Data replacements.

➢ Excellent verbal and written communication skills

➢ Aptitude for designing infrastructure, data products, and tools for Data Scientists

Preferred

➢ Familiarity with scraping and common scraping tools (Selenium, scrapy, Fiddler, Postman, xpath) ➢ Experience containerizing workloads with Docker (Kubernetes a plus)

➢ Experience with build automation (Jenkins, Gitlab CI / CD) ➢ Experience with AWS technologies like S3, RDS, SNS, SQS, Lambda, etc.

Create a job alert for this search

Data Engineer • Sangli, India

Related jobs
AWS Data Engineer (Remote)

AWS Data Engineer (Remote)

Mindcraft Labs • sangli, maharashtra, in
Remote
This role focuses on building and maintaining data pipelines and analytics infrastructure on AWS.You will work daily with S3, Glue, Redshift, Athena, Lake Formation, Airflow, SNS / SQS, and Postgres ...Show more
Last updated: less than 1 hour ago • Promoted • New!
Full Stack Engineer (4-6 YOE)

Full Stack Engineer (4-6 YOE)

Redica Systems • sangli, maharashtra, in
Redica Systems is a SaaS start-up serving more than 200 customers within the life science sector, with a specific focus on Pharmaceuticals and MedTech. Our workforce is distributed globally, with he...Show more
Last updated: 3 days ago • Promoted
Data Engineer

Data Engineer

System Soft Technologies • sangli, maharashtra, in
Location : Remote (3–4-hour time zone overlaps with EST if off shore).Experience with next flow is required, as the consultant will make targeted enhancements to existing workflows and pipelines.Whi...Show more
Last updated: 2 days ago • Promoted
Senior Snowflake Data Engineer

Senior Snowflake Data Engineer

Luxoft • sangli, maharashtra, in
We are seeking a highly skilled Snowflake Data Engineer with 7 years of IT experience to design, build, and optimize scalable data pipelines and cloud-based solutions across AWS, Azure, and GCP.The...Show more
Last updated: 2 days ago • Promoted
Full Stack Engineer

Full Stack Engineer

Insight Global • sangli, maharashtra, in
Duration : 6 month contract with potential to convert permanent.JS; primary codebase is frontend-heavy.Proficient with Git for source code management. Hands on experience with AWS Elastic Beans, EC2,...Show more
Last updated: 30+ days ago • Promoted
Senior Fullstack Engineer

Senior Fullstack Engineer

Black Dog Labs • sangli, maharashtra, in
Senior Fullstack Engineer (with Data Engineering Experience).Remote (collaboration across time zones), India or LATAM preferred. Proficient English communication.Full-Stack Engineering / Backend Eng...Show more
Last updated: 2 days ago • Promoted
Ai Data Engineer - 17852

Ai Data Engineer - 17852

Turing • Sāngli, Republic Of India, IN
We’re looking for experienced AI data engineers skilled in Python to collaborate with one of the world’s top Large Language Model (LLM) companies. Your work will directly help improve how AI models ...Show more
Last updated: 21 days ago • Promoted
Data Engineer

Data Engineer

Staffingine LLC • sangli, maharashtra, in
The Data Engineer will be responsible for designing, developing, and optimizing scalable data pipelines and cloud-based data solutions. This role requires strong Python programming skills, expertise...Show more
Last updated: less than 1 hour ago • Promoted • New!
Remote GenAI Engineer

Remote GenAI Engineer

EazyML • sangli, maharashtra, in
Remote
Founded by Bell Labs research veterans, and associated with breakthrough startups like Amelia, EazyML, specializes in Transparent Machine Learning. Early on EazyML founders saw the need for Transpa...Show more
Last updated: 30+ days ago • Promoted
GCP Big Data Engineer (Full-time at a Fortune 500 tech MNC )

GCP Big Data Engineer (Full-time at a Fortune 500 tech MNC )

HARP • sangli, maharashtra, in
We are looking for an experienced and motivated.The ideal candidate will have 5 years of relevant experience in data engineering, with a strong focus on. This role requires strong technical expertis...Show more
Last updated: 30+ days ago • Promoted
AWS AI / ML Engineer (Remote)

AWS AI / ML Engineer (Remote)

Mindcraft Labs • sangli, maharashtra, in
Remote
This is a hands-on engineering role focused on building and maintaining AI and ML services on AWS.You will help turn ideas and prototypes into robust, production-ready APIs and ML flows using Amazo...Show more
Last updated: less than 1 hour ago • Promoted • New!
Data Engineer(Intern)

Data Engineer(Intern)

Tech Phoenix • sangli, maharashtra, in
Tech Phoenix is a dynamic platform dedicated to all things technology, offering the latest news, trends, and insights from the tech industry. Whether you are a tech enthusiast or a professional, our...Show more
Last updated: 1 day ago • Promoted
Senior Snowflake + DBT Engineer (8+ Years)

Senior Snowflake + DBT Engineer (8+ Years)

MindBrain • sangli, maharashtra, in
Job Opportunity : Snowflake + DBT Engineer.We are seeking a highly skilled Snowflake + DBT Engineer to design, build, and optimize scalable cloud-based data platforms. The ideal candidate will have s...Show more
Last updated: less than 1 hour ago • Promoted • New!
Snowflake Data Engineer

Snowflake Data Engineer

Live Connections • sangli, maharashtra, in
Role - Snowflake Data Engineer.Required Notice Period - Immediate Joiner.To apply, connect with Abhishek via abhishek.Show more
Last updated: 13 days ago • Promoted
Python / JavaScript Developer – Web Scraping ( 4 to 7 Years)

Python / JavaScript Developer – Web Scraping ( 4 to 7 Years)

AIMLEAP • sangli, maharashtra, in
Python / JavaScript Developer – Web Scraping - 4 to 7 years.Bachelor's degree in Computer Science, Information Technology. Strong expertise in web scraping with hands-on experience in large-scale data...Show more
Last updated: 30+ days ago • Promoted
Technical Implementation Engineer

Technical Implementation Engineer

Y Meadows • sangli, maharashtra, in
Technical Implementation Engineer.Y Meadows is a United States-based company specializing in artificial intelligence and automation solutions. We provide a low-code automation platform that streamli...Show more
Last updated: 30+ days ago • Promoted
AI Engineer

AI Engineer

NyxaLabs • sangli, maharashtra, in
We're seeking an exceptional AI Engineer with deep expertise in TensorFlow model training to design and build next-generation AI systems. This role focuses on developing sophisticated machine learni...Show more
Last updated: 20 hours ago • Promoted • New!
Senior Data Engineer

Senior Data Engineer

Arenema • sangli, maharashtra, in
India (remote – Bangalore / Karnataka area preferred).Full-time contractor / employee.You will be a core member of the team building a data platform that maps economic, advertising and real-estate ac...Show more
Last updated: less than 1 hour ago • Promoted • New!