Introducing Schrute.jl: The Office Transcripts Data Set for Julia

By: Brad Lindblad
LinkedIn | Github | Blog | Twitter

In an effort to broaden my horizons, I’ve ported the popular Schrute package from both R and python to the Julia language.

The package has one function which returns a dataframe containing the entire transcripts from The Office; 55130 lines in 9 seasons. There are also attribute fields such as writer, director and imdb score for each episode.

For a complete rundown (Charles Miner), see the package tutorial, or view the source code at the Github repo.

comments powered by Disqus