Talent.com
Data Scientist - Python/PySpark
Data Scientist - Python/PySparkBOT Consulting • Jaipur
No longer accepting applications
Data Scientist - Python / PySpark

Data Scientist - Python / PySpark

BOT Consulting • Jaipur
30+ days ago
Job description

About the job

We are seeking a skilled Data Scientist with 2 to 5 years of experience, specializing in Machine Learning, PySpark, and Databricks, with a proven track record in long-range demand and sales forecasting. This role is crucial for the development and implementation of an automotive OEMs next-generation Intelligent Forecast Application.

The position will involve building, optimising, and deploying large-scale machine learning models for complex, long-term forecasting challenges using distributed computing frameworks, specifically PySpark on the Databricks platform. The work will directly support strategic decision-making across the automotive value chain, including areas like long-term demand planning, production scheduling, and inventory optimization.

The ideal candidate will have hands-on experience developing and deploying ML models for forecasting, particularly long-range predictions, in a production environment using PySpark and Databricks.

This role requires strong technical skills in machine learning, big data processing, and time series forecasting, combined with the ability to work effectively within a technical team to deliver robust and scalable long-range forecasting solutions.

Roles & Responsibilities :

  • Machine Learning Model Development & Implementation for Long-Range Forecasting : Design, develop, and implement scalable and accurate machine learning models specifically for long-range demand and sales forecasting challenges.
  • Apply advanced time series analysis techniques and integrate them with machine learning models leveraging PySpark for data processing and model training on large datasets within the Databricks environment.
  • Implement probabilistic forecasting methods using PySpark to capture uncertainty in long-range predictions.
  • Develop robust solutions for hierarchical and grouped long-range forecasting on distributed data.
  • Data Processing and Feature Engineering with PySpark : Build and optimize large-scale data pipelines for ingesting, cleaning, transforming, and engineering features relevant to long-range forecasting from diverse, complex automotive datasets using PySpark on Databricks.
  • Deployment and MLOps on Databricks : Develop and implement robust code for model training, inference, and deployment of long-range forecasting models directly within the Databricks platform.
  • Apply MLOps principles compatible with Databricks workflows for model versioning, monitoring, retraining, and managing the lifecycle of long-range ML forecasting models in production.
  • Collaborate with Data Engineering and IT Operations to ensure seamless deployment and operational efficiency of the forecasting application on Databricks.
  • Performance Evaluation & Optimization : Evaluate long-range forecasting model performance using relevant metrics (e.g., MAE, RMSE, MAPE, considering metrics suitable for longer horizons) and optimize models and data processing pipelines for improved accuracy and efficiency within the PySpark / Databricks ecosystem.
  • Technical Collaboration : Work effectively as part of a technical team, collaborating with other data scientists, data engineers, and software developers to integrate ML long-range forecasting solutions into the broader forecasting application built on Databricks.
  • Communicate technical details and forecasting results effectively within the technical team.

Qualifications :

  • Education : Bachelor's or Master's degree in Data Science, Computer Science, Statistics, Applied Mathematics, or a closely related quantitative field.
  • Experience : 2 to 5 years of hands-on experience in a Data Scientist or Machine Learning Engineer role.
  • Proven experience developing and deploying machine learning models in a production environment.
  • Demonstrated experience in long-range demand and sales forecasting.
  • Significant hands-on experience with PySpark for large-scale data processing and machine learning.
  • Extensive practical experience working with the Databricks platform, including notebooks, jobs, and ML capabilities.
  • Technical Skills :

  • Optimization (Must) : Strong understanding of nonlinear optimization, constrained minimization, and time-series simulation
  • Machine Learning & Forecasting : Strong expertise in building and deploying ML models, especially for long-range demand and sales forecasting.
  • Time Series Analysis : Proficiency in advanced time series techniques, including probabilistic and hierarchical forecasting.
  • Databricks Platform : Skilled in using Databricks for model development, training, and deployment in a production environment.
  • MLOps : Familiarity with model deployment, monitoring, and automation workflows using AWS SageMaker.
  • Domain Knowledge : Experience working with complex datasets, preferably in the automotive or manufacturing sector.
  • Expert proficiency in PySpark.
  • Strong proficiency in Python and SQL.
  • Experience with machine learning libraries compatible with PySpark (e.g., MLlib, or integrating other libraries).
  • Experience with advanced time series forecasting techniques and their implementation.
  • Experience with distributed computing concepts and optimization techniques relevant to PySpark.
  • Hands-on experience with a major cloud provider (Azure, AWS, or GCP) in the context of using Databricks.
  • Familiarity with MLOps concepts and tools used in a Databricks environment.
  • Experience with data visualization tools.
  • Analytical skills with a deep understanding of machine learning algorithms and their application to forecasting.
  • Ability to troubleshoot and solve complex technical problems related to big data and machine learning workflows
  • Preferred Qualifications :

  • Experience with specific long-range forecasting methodologies and libraries used in a distributed environment
  • Experience with real-time or streaming data processing using PySpark for near-term forecasting components that might complement long-range models.
  • Familiarity with automotive data types relevant to long-range forecasting (e.g., economic indicators affecting car sales, long-term market trends).
  • Experience with distributed version control systems (e.g., Git).
  • Knowledge of agile development methodologies
  • Soft Skills :

  • Collaboration : Ability to work effectively as part of a technical team.
  • Communication : Clear and concise communication of technical details and forecasting results.
  • Problem-Solving : Ability to tackle complex technical challenges and find efficient solutions.
  • Learning Agility : Eagerness to learn and adapt to new technologies and methodologies within the PySpark / Databricks ecosystem and advancements in long-range forecasting.
  • Ability to understand business needs related to long-term planning.
  • (ref : hirist.tech)

    Create a job alert for this search

    Data Scientist • Jaipur

    Related jobs
    Data Scientist

    Data Scientist

    Fulcrum Digital Inc • Jaipur, IN
    We are looking for a skilled and hands-on.ML algorithms to advanced deep learning and Generative AI systems.The ideal candidate brings a strong foundation in classification, anomaly detection, and ...Show more
    Last updated: 14 days ago • Promoted
    Data Scientist

    Data Scientist

    Qubryx • Jaipur, IN
    Senior Data Scientist (Remote – India) – Predictive Modeling & Machine Learning.We are looking for a highly skilled.India-based team in a remote capacity. This role focuses on building and deploying...Show more
    Last updated: 30+ days ago • Promoted
    Data Scientist - DSP Audience Optimization

    Data Scientist - DSP Audience Optimization

    The Trade Desk • Jaipur, IN
    Data Scientist – Digital Advertising & Marketing Technology.We are a global technology team responsible for building and managing advanced advertising platforms that power large-scale digital marke...Show more
    Last updated: 21 days ago • Promoted
    Data Scientist

    Data Scientist

    Lingaro • Jaipur, IN
    Data Scientist - Consumer Analytics.Growth through diversity, equity, and inclusion.As an ethical business, we do what is right — including ensuring equal opportunities and fostering a safe, respec...Show more
    Last updated: 30+ days ago • Promoted
    Multiple SAP Positions (SAP HANA Cloud, Big Data, Python, Datasphere) Remote, India

    Multiple SAP Positions (SAP HANA Cloud, Big Data, Python, Datasphere) Remote, India

    MIB IT Solutions • jaipur, rajasthan, in
    Remote
    We’re Hiring Multiple SAP Roles for a New Implementation Project!.Availability : Immediate to 60 Days.We’re thrilled to announce multiple openings across SAP competencies for an exciting implementat...Show more
    Last updated: 4 hours ago • Promoted • New!
    Data Scientist

    Data Scientist

    Recro • Jaipur, IN
    We’re seeking a highly skilled, hands-on Data Scientist with 4–10 years of experience in applied AI / ML to join our fast-paced team. This role requires deep expertise in transformer architectures and...Show more
    Last updated: 30+ days ago • Promoted
    Data Scientist

    Data Scientist

    72 Dragons • Jaipur, IN
    Dragons is a global worldwide production company as well as a set of global services in production, social media, software, data science, design, and curation. We are based and work in China, India,...Show more
    Last updated: 30+ days ago • Promoted
    Data Scientist (Risk strategy, SME)

    Data Scientist (Risk strategy, SME)

    bluCognition • Jaipur, IN
    Cognition is an AI / ML based start-up specializing in risk analytics, data conversion and data enrichment capabilities.Founded in 2017, by some very named senior professionals from the financial ser...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Scientist

    Senior Data Scientist

    Lingaro • Jaipur, IN
    Data Scientist - GenAI Expert / Tech Lead.We are looking for a highly skilled and experienced GenAI Expert / Tech Lead with solid expertise in Generative AI (GenAI) to guide our teams & projects in bui...Show more
    Last updated: 20 days ago • Promoted
    Sr. Data Scientist

    Sr. Data Scientist

    Hyper Lychee Labs • Jaipur, IN
    Hyper Lychee Labs is a dynamic IT services and consulting company delivering innovative solutions to a diverse client base, including Fortune 500 companies, leading startups, and government entitie...Show more
    Last updated: 5 days ago • Promoted
    Senior Python Data Engineer

    Senior Python Data Engineer

    iVoyant • jaipur, rajasthan, in
    Join a dynamic engineering team working on a high-impact tax reporting platform for the 2025 fiscal season.The core goal is to modernize and significantly accelerate the generation of Excel-based r...Show more
    Last updated: 22 days ago • Promoted
    Data Engineer

    Data Engineer

    IntraEdge • Jaipur, IN
    We are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our growing data team. You will be responsible for building scalable and reli...Show more
    Last updated: 30+ days ago • Promoted
    Senior Data Engineer - Data Acquisition

    Senior Data Engineer - Data Acquisition

    InfoBeans • jaipur, rajasthan, in
    We are seeking a highly skilled.Senior Data Engineer – Data Acquisition (ODS).The ideal candidate will have extensive hands-on experience in building and optimizing data ingestion and transformatio...Show more
    Last updated: 28 days ago • Promoted
    Python Data Engineer

    Python Data Engineer

    TechKareer • jaipur, rajasthan, in
    We're helping one of the biggest debt consulting companies in the US expand their IT team.If you get selected, you will be directly working with them. Please apply if you match with all of the Gener...Show more
    Last updated: 4 hours ago • Promoted • New!
    Data Scientist AI / ML

    Data Scientist AI / ML

    Somnetics (Som Imaging Informatics Pvt. Ltd.) • Jaipur, IN
    Machine Learning, Deep Learning, NLP, and Computer Vision.TensorFlow / PyTorch, scikit-learn).Generative AI, LangChain, RAG, LLMs. Docker, Kubernetes, MLflow, Git, Azure.Strong analytical, problem-sol...Show more
    Last updated: 15 days ago • Promoted
    Data Scientist

    Data Scientist

    GEFEN OPTICAL, LLC • Jaipur, IN
    Do you live and breathe data, keywords, and performance metrics? We are hiring an Amazon PPC Specialist who can turn data into growth. If you know how to make campaigns profitable, scalable, and per...Show more
    Last updated: 8 days ago • Promoted
    Senior Big Data Developer_ Exp : 6+ Years

    Senior Big Data Developer_ Exp : 6+ Years

    Atyeti Inc • Jaipur, IN
    Analyze, organize and process raw data using Bigdata technologies and Spark.Perform data validation, cleaning and transformation using big data technologies, Spark. Ingest and manage data in HDFS / Hi...Show more
    Last updated: 22 hours ago • Promoted • New!
    Data Scientist

    Data Scientist

    Danone • Jaipur, IN
    We are looking for a professional who can drive.Machine Learning models for our.The ideal candidate will combine strong data science capabilities with effective communication skills to deliver exce...Show more
    Last updated: 20 days ago • Promoted