NLP Pipeline Manager
Natural Language Processing (NLP) has a lot of very frustrating parts. In this post, I introduce a library
I wrote, and how I hope it makes NLP suck less.
What is a Monte Carlo Simulation (Part 1)
The first in a series of posts dedicated to exploring the power and flexibility of Monte Carlo techniques. In this post we ask, "what the heck is a Monte Carlo Simulation anyway?"
Recommendation Engines for Dummies
A look into how collaborative filtering works for recommendations, with some Python code to build your own from scratch. Targeted for those without deep techincal knowledge of data science.
KMeans Color Compressor
Using a clustering algorithm called KMeans, I stylize images by forcing pixels into groups of colors.
Analysis Tree Maker
This C++ code is used to convert an unmanageable 60+TB of data into a smaller, but still usable data structure for extracting physics results.
zPlot - A ROOT Extension
A C++ class to make managing plot-like objects within CERN's ROOT program simpler, more user-friendly, and more consistent.
The Drunken Walker(s)
A visualization of the classic "drunken walk" physics thought-experiment. A mix of random numbers and Jackson Pollock.
Input the statistics of two teams of baseball players, then simulate games to your heart's content.