This individual will be a key contributor in all aspects of the MDAS(Model Delivery as a Service) and must be willing to work in agile project framework. One of the key cornerstones of MDAS personnel is the ability to adapt and quickly learn the business needs throughout the entire modeling implementation process. The incumbent would lead all quality assurance and testing effort that the team works on and must be able to provide guidance to junior team members. The incumbent must ensure that the documentation and archiving of all relevant artifacts of the validation done is maintained in an orderly manner for audit purpose.
Responsibilities :
- Import, clean, transform and validate data from multiple sources and systems (Teradata, Oracle, Cloud DB) in preparation for analytics and / or modeling
- Have an understanding of Python, PySpark and Hadoop with a willingness to become an expert in Big Data and Machine Learning processes.
- Leverage a variety of analytical and statistical applications (SAS, SQL, Python, PySpark, JAVA, Scala) to describe, analyze, and validate trends in large complex data sets.
- Build and develop Proof-of-Concept machine learning outlier detection systems to run in Cloud Platform using Python / PySpark or JAVA / Scala.
- Oversee and validate production processes as they are being developed and implemented.
Qualifications
6+ years of experience doing hands-on data analysis.Technical expertise regarding data models, data analysis, and segmentation techniquesSelf-sufficiency in querying and extracting data from Cloud based open source platforms like Data Bricks and Snowflake .Strong analytical skills with the ability to collect, organize, analyze, and disseminate significant amounts of information with attention to detail and accuracyAbility to present fact-based recommendations in a clear, logical, and concise way 'tell a story' with dataSuperior written and oral communication skills ability to communicate effectively with all levels of management and partners from a variety of business functionsBS in Mathematics, Economics, Computer Science, Information Management, Statistics, Engineering or related quantitative discipline preferredMust be proactive, results driven and have a proven track record of execution
Skills Required
Sas, Sql, snowflake , Java, Hadoop, Pyspark, Machine Learning, Python, Big Data, Scala, Teradata