Reproducible Data Analysis in Python | Initium Lab | Exploratory Arm of Initium Media

Reproducible Data Analysis in Python

Hong Kong
Unit 1907, Prosperity Millennia Plaza, 663 King's Road, Hong Kong
Chris Choy, Senior Computational Scientist at ClusterTech

Data analysis consists of multiple steps: data cleansing, modelling and reporting. Reproducible analysis consists of a set of codes that execute the data pipeline from raw data to reports. Reproducibility allows one to easily trace the details of your analysis, adapt to changes and provide updated analysis when new data arrives.

Data Reproducible Research Python

About the Speaker

Chris is currently a Senior Computational Scientist at ClusterTech. He has worked on tailored data mining solutions for clients in the banking and retail industry. Prior to joining ClusterTech in 2014, he earned his Dphil in Oxford with a research focus in high dimensional statistics.