My Mission

Spread the word: Data doesn't speak for itself; it needs an interpreter.

​​Why is the average age of death for male rappers under 30? Why are the best scoring schools the smallest ones? Are large earthquakes on the rise? Do dead salmon have brain activity when shown photographs? Are the Sophomore Slump and the Sport Illustrated Jinx real or imaginary? Do drugs for relaxation help students score higher on the SAT? Why does punishment seem to work better than reward? Why are movie sequels rarely as good as the originals? These are the types of questions that data scientists should be well positioned to answer, but knowing the specific answers isn’t as important as being familiar with the underlying principles and pitfalls which lead less careful thinkers astray.


Businesses call themselves "data driven" and think they know what data is telling them ("up is up"). However, many are not analyzing things in a scientifically valid way and are setting themselves up to be duped by data. My goal is to help train the next generation of data scientists and managers to avoid the pitfalls, whether it's through my book or by directly speaking to them about what I've learned. Most books contain success stories, but mine is mostly filled with "failure stories", which should be more instructive. Data science works, but only if you do it right.



 

  • Co-Author of "The 9 Pitfalls of Data Science" (Oxford University Press)
  • Master of Information and Data Science degree from UC Berkeley

Some of my greatest hits

  • Redesigned the statistics for A/B testing and the "auto-tester" used for optimization experiments at Oversee.net, one year leading to a lift in revenue per visitor of over 40%.
  • Used regression analysis to improve Oversee's domain buying profit by over 100% and optimized renewal strategies to save about $30k per month.
  • Scored a perfect 10 on the probability / statistics actuarial exam.




  • Led an effort to objectively rank  MMA fighters for the first time at Sherdog.com.