Description
Ciklum is looking for an Expert Data Scientist to join our team full-time in India.
We are a custom product engineering company that supports both multinational organizations and scaling startups to solve their most complex business challenges. With a global team of over 4,000 highly skilled developers, consultants, analysts and product owners, we engineer technology that redefines industries and shapes the way people live.
About the role :
As an Expert Data Scientist, become a part of a cross-functional development team engineering experiences of tomorrow.
Responsibilities
- Development of prototype solutions, mathematical models, algorithms, machine learning techniques, and robust analytics to support analytic insights and visualization of complex data sets
- Work on exploratory data analysis so you can navigate a dataset and come out with broad conclusions based on initial appraisals
- Provide optimization recommendations that drive KPIs established by product, marketing, operations, PR teams, and others
- Interacts with engineering teams and ensures that solutions meet customer requirements in terms of functionality, performance, availability, scalability, and reliability
- Work directly with business analysts and data engineers to understand and support their use cases
- Work with stakeholders throughout the organization to identify opportunities for leveraging company data to drive business solutions
- Drive innovation by exploring new experimentation methods and statistical techniques that could sharpen or speed up our product decision-making processes
- Cross-train other team members on technologies being developed, while also continuously learning new technologies from other team members
- Contribute to the Unit activities and community building, participate in conferences, and provide excellence in exercise and best practices.
- Support marketing & sales activities, customer meetings and digital services through direct support for sales opportunities & providing thought leadership & content creation for the service
Requirements
We know that sometimes, you can’t tick every box. We would still love to hear from you if you think you’re a good fit!
General technical requirements :
BSc, MSc, or PhD in Mathematics, Statistics, Computer Science, Engineering, Operations Research, Econometrics, or related fieldsStrong knowledge of Probability Theory, Statistics, and a deep understanding of the Mathematics behind Machine LearningProficiency with CRISP-ML(Q) or TDSP methodologies for addressing commercial problems through data science solutionsHands-on experience with various machine learning techniques, including but not limited to : Regression Classification Clustering Dimensionality reductionProficiency in Python for developing machine learning models and conducting statistical analysesStrong understanding of data visualization tools and techniques (., Python libraries such as Matplotlib, Seaborn, Plotly, and the ability to present data effectivelySpecific technical requirements :
Proficiency in SQL for data processing, data manipulation, sampling, and reportingExperience working with imbalanced datasets and applying appropriate techniquesExperience with time series data, including preprocessing, feature engineering, and forecastingExperience with outlier detection and anomaly detectionExperience working with various data types : text, image, and video dataFamiliarity with AI / ML cloud implementations (AWS, Azure, GCP) and cloud-based AI / ML services (., Amazon SageMaker, Azure ML)Domain experience :
Experience with analyzing medical signals and imagesExpertise in building predictive models for patient outcomes, disease progression, readmissions, and population health risksExperience in extracting insights from clinical notes, medical literature, and patient-reported data using NLP and text mining techniquesFamiliarity with survival or time-to-event analysisExpertise in designing and analyzing data from clinical trials or research studiesExperience in identifying causal relationships between treatments and outcomes, such as propensity score matching or instrumental variable techniquesUnderstanding of healthcare regulations and standards like HIPAA, GDPR (for healthcare data), and FDA regulations for medical devices and AI in healthcareExpertise in handling sensitive healthcare data in a secure, compliant way, understanding the complexities of patient consent, de-identification, and data sharingFamiliarity with decentralized data models such as federated learning to build models without transferring patient data across institutionsKnowledge of interoperability standards such as HL7, SNOMED, FHIR, or DICOMAbility to work with clinicians, researchers, health administrators, and policy makers to understand problems and translate data into actionable healthcare insightsGood to have skills :
Experience with MLOps, including integration of machine learning pipelines into production environments, Docker, and containerization / orchestration (., Kubernetes)Experience in deep learning development using TensorFlow or PyTorch librariesExperience with Large Language Models (LLMs) and Generative AI applicationsAdvanced SQL proficiency, with experience in MS SQL Server or PostgreSQLFamiliarity with platforms like Databricks and Snowflake for data engineering and analyticsExperience working with Big Data technologies (., Hadoop, Apache Spark)Familiarity with NoSQL databases (., columnar or graph databases like Cassandra, Neo4j)Business-related requirements :
Proven experience in developing data science solutions that drive measurable business impact, with a strong track record of end-to-end project executionAbility to effectively translate business problems into data science problems and create solutions from scratch using machine learning and statistical methodsExcellent project management and time management skills, with the ability to manage complex, detailed work and effectively communicate progress and results to stakeholders at all levelsDesirable
Research experience with peer-reviewed publicationsRecognized achievements in data science competitions, such as KaggleCertifications in cloud-based machine learning services (AWS, Azure, GCP)What's in it for you
Care : your mental and physical health is our priority. We ensure comprehensive company-paid medical insurance, as well as financial and legal consultationTailored education path : boost your skills and knowledge with our regular internal events (meetups, conferences, workshops), Udemy licence, language courses and company-paid certificationsGrowth environment : share your experience and level up your expertise with a community of skilled professionals, locally and globallyFlexibility : hybrid work mode at Chennai or PuneOpportunities : we value our specialists and always find the best options for them. Our Resourcing Team helps change a project if needed to help you grow, excel professionally and fulfil your potentialGlobal impact : work on large-scale projects that redefine industries with international and fast-growing clientsWelcoming environment : feel empowered with a friendly team, open-door policy, informal atmosphere within the company and regular team-building events