Kaggle Kernels - Statistical Modeling, Causal Inference, and Social Science

lkfitz's bookmarks 2016-11-15

Anthony Goldbloom writes:

In late August, Kaggle launched an open data platform where data scientists can share data sets. In the first few months, our members have shared over 300 data sets on topics ranging from election polls to EEG brainwave data. It’s only a few months old, but it’s already a rich repository for interesting data sets.

It’s also a nice place to share reproducible data science. We have built a tool called Kaggle Kernels, which allows data scientists and statisticians to share notebooks and scripts in Python or R on top of the data. If you find analysis you want to extend, you can “fork it” which gives you a reproducible version without going through the pain of replicating the author’s environment. It’s useful for learning new techniques (by being able to fork and play with other’s code), to share your side project with a large community and to draw attention to your research and store it in a way that can be easily reproduced.

He adds:

We don’t support Stan yet but we inevitably will.

Sooner rather than later, I hope!

The post Kaggle Kernels appeared first on Statistical Modeling, Causal Inference, and Social Science.