Web Picks (week of 15 June 2015)

Every two weeks, we find the most interesting data science links from around the web and collect them in Data Science Briefings, the DataMiningApps newsletter. Subscribe now for free if you want to be the first to get up to speed on interesting resources.


  • Jupyter is getting traction
    We’ve mentioned Jupyter before in our newsletter as the follow-up of IPython. A lot has happened in just a short span of time, and the project now supports a wealth of languages, including R, Lua, Julia, and of course Python.
  • SparkR preview in Rstudio
    No worries for you RStudio users, as the RStudio blog already describes how you can get up and running with Spark directly within your favored development environment.
  • Rodeo has gotten an update as well
    Another tool we’ve mentioned when it was first released has received many updates. Rodeo is a data centric IDE for Python, and now supports Spark out of the box as well.
  • Extending “Let It Go” with LSTM
    A fun one: recurrent neural networks get all the hype these days, and this page shows how a recurrent neural network (with Long Short Term Memory) generates the next thousand of bytes of the popular song “Let It Go”.
  • MarI/O – Machine Learning for Video Games
    Another fun entry to close the list: this video shows how a neural network can be evolved to learn how to play Mario, using NEAT (NeuroEvolution of Augmenting Topologies, which combines neural networks with a genetic algorithm based training method). We wonder how it generalizes on unseen instances (i.e. levels), but the video is interesting nonetheless.