Talent.com
This job offer is not available in your country.
Data Scientist - Python / PySpark

Data Scientist - Python / PySpark

BOT ConsultingJaipur
8 days ago
Job description

About the job

We are seeking a skilled Data Scientist with 2 to 5 years of experience, specializing in Machine Learning, PySpark, and Databricks, with a proven track record in long-range demand and sales forecasting. This role is crucial for the development and implementation of an automotive OEMs next-generation Intelligent Forecast Application.

The position will involve building, optimising, and deploying large-scale machine learning models for complex, long-term forecasting challenges using distributed computing frameworks, specifically PySpark on the Databricks platform. The work will directly support strategic decision-making across the automotive value chain, including areas like long-term demand planning, production scheduling, and inventory optimization.

The ideal candidate will have hands-on experience developing and deploying ML models for forecasting, particularly long-range predictions, in a production environment using PySpark and Databricks.

This role requires strong technical skills in machine learning, big data processing, and time series forecasting, combined with the ability to work effectively within a technical team to deliver robust and scalable long-range forecasting solutions.

Roles & Responsibilities :

  • Machine Learning Model Development & Implementation for Long-Range Forecasting : Design, develop, and implement scalable and accurate machine learning models specifically for long-range demand and sales forecasting challenges.
  • Apply advanced time series analysis techniques and integrate them with machine learning models leveraging PySpark for data processing and model training on large datasets within the Databricks environment.
  • Implement probabilistic forecasting methods using PySpark to capture uncertainty in long-range predictions.
  • Develop robust solutions for hierarchical and grouped long-range forecasting on distributed data.
  • Data Processing and Feature Engineering with PySpark : Build and optimize large-scale data pipelines for ingesting, cleaning, transforming, and engineering features relevant to long-range forecasting from diverse, complex automotive datasets using PySpark on Databricks.
  • Deployment and MLOps on Databricks : Develop and implement robust code for model training, inference, and deployment of long-range forecasting models directly within the Databricks platform.
  • Apply MLOps principles compatible with Databricks workflows for model versioning, monitoring, retraining, and managing the lifecycle of long-range ML forecasting models in production.
  • Collaborate with Data Engineering and IT Operations to ensure seamless deployment and operational efficiency of the forecasting application on Databricks.
  • Performance Evaluation & Optimization : Evaluate long-range forecasting model performance using relevant metrics (e.g., MAE, RMSE, MAPE, considering metrics suitable for longer horizons) and optimize models and data processing pipelines for improved accuracy and efficiency within the PySpark / Databricks ecosystem.
  • Technical Collaboration : Work effectively as part of a technical team, collaborating with other data scientists, data engineers, and software developers to integrate ML long-range forecasting solutions into the broader forecasting application built on Databricks.
  • Communicate technical details and forecasting results effectively within the technical team.

Qualifications :

  • Education : Bachelor's or Master's degree in Data Science, Computer Science, Statistics, Applied Mathematics, or a closely related quantitative field.
  • Experience : 2 to 5 years of hands-on experience in a Data Scientist or Machine Learning Engineer role.
  • Proven experience developing and deploying machine learning models in a production environment.
  • Demonstrated experience in long-range demand and sales forecasting.
  • Significant hands-on experience with PySpark for large-scale data processing and machine learning.
  • Extensive practical experience working with the Databricks platform, including notebooks, jobs, and ML capabilities.
  • Technical Skills :

  • Optimization (Must) : Strong understanding of nonlinear optimization, constrained minimization, and time-series simulation
  • Machine Learning & Forecasting : Strong expertise in building and deploying ML models, especially for long-range demand and sales forecasting.
  • Time Series Analysis : Proficiency in advanced time series techniques, including probabilistic and hierarchical forecasting.
  • Databricks Platform : Skilled in using Databricks for model development, training, and deployment in a production environment.
  • MLOps : Familiarity with model deployment, monitoring, and automation workflows using AWS SageMaker.
  • Domain Knowledge : Experience working with complex datasets, preferably in the automotive or manufacturing sector.
  • Expert proficiency in PySpark.
  • Strong proficiency in Python and SQL.
  • Experience with machine learning libraries compatible with PySpark (e.g., MLlib, or integrating other libraries).
  • Experience with advanced time series forecasting techniques and their implementation.
  • Experience with distributed computing concepts and optimization techniques relevant to PySpark.
  • Hands-on experience with a major cloud provider (Azure, AWS, or GCP) in the context of using Databricks.
  • Familiarity with MLOps concepts and tools used in a Databricks environment.
  • Experience with data visualization tools.
  • Analytical skills with a deep understanding of machine learning algorithms and their application to forecasting.
  • Ability to troubleshoot and solve complex technical problems related to big data and machine learning workflows
  • Preferred Qualifications :

  • Experience with specific long-range forecasting methodologies and libraries used in a distributed environment
  • Experience with real-time or streaming data processing using PySpark for near-term forecasting components that might complement long-range models.
  • Familiarity with automotive data types relevant to long-range forecasting (e.g., economic indicators affecting car sales, long-term market trends).
  • Experience with distributed version control systems (e.g., Git).
  • Knowledge of agile development methodologies
  • Soft Skills :

  • Collaboration : Ability to work effectively as part of a technical team.
  • Communication : Clear and concise communication of technical details and forecasting results.
  • Problem-Solving : Ability to tackle complex technical challenges and find efficient solutions.
  • Learning Agility : Eagerness to learn and adapt to new technologies and methodologies within the PySpark / Databricks ecosystem and advancements in long-range forecasting.
  • Ability to understand business needs related to long-term planning.
  • (ref : hirist.tech)

    Create a job alert for this search

    Data Scientist • Jaipur

    Related jobs
    • Promoted
    Data Scientist

    Data Scientist

    v4c.aiJaipur, IN
    The Data Scientist supports the development and implementation of data models, focusing on Machine Learning, under the supervision of more experienced scientists, contributing to the team’s innovat...Show moreLast updated: 30+ days ago
    • Promoted
    Python Developer – AI / ML & Data Science

    Python Developer – AI / ML & Data Science

    EgnotoJaipur, Rajasthan, India
    Python Developer – AI / ML & Data Science.Develop, optimize, and maintain.Big Data frameworks (Hadoop, Spark, Databricks, etc. ETL / ELT) for scalable and reliable workflows.CI / CD pipelines, model deplo...Show moreLast updated: 11 days ago
    • Promoted
    Backend Python / AI Engineer

    Backend Python / AI Engineer

    JuiceLabs AIJaipur, IN
    Where creative engineering meets applied AI.At JuiceLabs, we’re building vertical AI-native tools that unlock fresh insights and creative superpowers for our clients in advertising, ecommerce, and ...Show moreLast updated: 20 days ago
    • Promoted
    Data Scientist

    Data Scientist

    MethodHubJaipur, IN
    They are looking for a candidate with.Solid understanding of machine learning algorithms, model evaluation, and deployment is a.Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist

    Data Scientist

    XebiaJaipur, Rajasthan, India
    We are seeking a highly skilled and motivated.The ideal candidate will have strong analytical skills, expertise in data modeling, and the ability to translate complex data into actionable insights ...Show moreLast updated: 20 days ago
    • Promoted
    Big Data with Quantexa-Pan India

    Big Data with Quantexa-Pan India

    Tata Consultancy ServicesJaipur, IN
    We are looking for Big data developer with Quantexa as mandatory skill.Experience Range- 5 to 12 years.Data Engineer - Big Data Hadoop, Spark and Scala. Spark, Scala, Hadoop, HIVE, Oozie, Shell Scri...Show moreLast updated: 13 days ago
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Mitchell Martin Inc.Jaipur, IN
    Include but are not limited to the following : .Apply machine learning, deep learning, and artificial intelligence techniques. Use advanced analytics methods to extract value from business data.Perfor...Show moreLast updated: 30+ days ago
    • Promoted
    Data Scientist

    Data Scientist

    EXLJaipur, IN
    Skills / Qualifications Required : .ML based project’s development & implementation.Proficient in building and deploying. Machine Learning (Predictive & Prescriptive modelling).Proficient in tools / la...Show moreLast updated: 30+ days ago
    • Promoted
    Sr. Data Scientist, R&D (Remote, India)

    Sr. Data Scientist, R&D (Remote, India)

    PulsePointJaipur, IN
    Remote
    We help brands and agencies interpret the hard-to-read signals across the health journey and unify these digital determinants of health with real-world data to produce the most dimensional view of ...Show moreLast updated: 30+ days ago
    • Promoted
    Data Engineer

    Data Engineer

    Vriba SolutionsJaipur, IN
    Design, develop & maintain ETL / ELT pipelines.Ingest & transform data from APIs, DBs, files, streams.Build real-time & batch processing solutions. Data validation, quality & cleansing.Translate busin...Show moreLast updated: 8 days ago
    • Promoted
    Associate Data Scientist

    Associate Data Scientist

    v4c.aiJaipur, IN
    The Associate Data Scientist supports the development and implementation of data models, focusing on Machine Learning, under the supervision of more experienced scientists, contributing to the team...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    PROADSW3 Relations pvt. ltd.Jaipur, IN
    Hiring : Senior Data Scientist (Remote) | 5-7 Yrs Experience | Budget : 1.Isolation Forest, Autoencoders, LSTM, and more. Python using libraries such as.Solve challenging data problems.Data Science / ...Show moreLast updated: 8 days ago
    • Promoted
    Senior Data Engineer

    Senior Data Engineer

    SAIVA AIJaipur, IN
    We are building the future of healthcare analytics.Join us to design, build, and scale robust data pipelines that power nationwide analytics and support our machine learning systems.Our goal : pipel...Show moreLast updated: 20 days ago
    • Promoted
    Senior Data Engineer – Snowflake & Python (Remote, India or EU)

    Senior Data Engineer – Snowflake & Python (Remote, India or EU)

    Az-Tec TalentJaipur, IN
    Remote
    Senior Data Engineer – Snowflake & Python (Remote, India or EU).ASAP (ideally next week following interview).SQL Server to Snowflake migration project. This is a high-impact role requiring strong ha...Show moreLast updated: 6 days ago
    • Promoted
    Python Programmer

    Python Programmer

    IGT SolutionsJaipur, IN
    Interested candidate please share resume on Kalyani.Design, develop, and maintain scalable data pipelines using Python and PySpark. Work with SQL and MongoDB to manage and query large datasets.Devel...Show moreLast updated: 6 days ago
    • Promoted
    Senior Python Developer

    Senior Python Developer

    Xsell ResourcesJaipur, IN
    Seeking Senior Python Developers with 8+ years of experience in Python-SQL, Python-API, SQL-Tableau for our Fortune 5 Health care client. We are looking for a skilled Python Developer with hands-on ...Show moreLast updated: 6 days ago
    • Promoted
    Data Scientist

    Data Scientist

    vueverse.Jaipur, IN
    We are looking for an experienced.Machine Learning, Generative AI, and Large Language Models (LLMs).NLP, predictive modeling, and analytics. Integrate ML solutions into business systems in collabora...Show moreLast updated: 30+ days ago
    • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Delivery HeroJaipur, IN
    As the world’s leading local delivery platform, our mission is to deliver an amazing experience, fast, easy, and to your door. We operate in over 70+ countries worldwide, powered by tech but driven ...Show moreLast updated: 20 days ago