Tag Archives: Data Vizualization

Exploratory Analysis – When to Choose R, Python, Tableau or a Combination

Not all data analysis tools are created equal.

Recently, I started looking into data sets to compete in Go Code Colorado (check it out if you live in CO). The problem with such diversity in data sets is finding a way to quickly visualize the data and do exploratory analysis. While tools like Tableau make data visualization extremely easy, the data isn’t always properly formatted to be easily consumed. Here’s are a few tips to help speed up your exploratory data analysis!

We’ll use data from two sources to aid with this example:

Picking the right tool

Always be able to answer the following before choosing a tool:

Continue reading

George Washington as a Constitutional Word Cloud

Is George Washington better looking on the dollar bill or represented by a word cloud built with the text of The Constitution of the USA?

A colleague recently asked me that exact question. If you want to be taken seriously in the data science world, you better be able to answer something like this!

I decided that it would be fun to show off a Python package by Andreas Mueller called word_cloud (here) to make a fun image with the text of the Constitution and an image of one of the Founding Fathers.

I must warn you, word clouds are like pie charts people like the way they look but clouds don’t provide much information. That said, this package is really neat because it allows you to easily turn text into images utilizing masks, colors, and numpy!

I’ll keep this post short, what you want to do is simple:

  1. Select an image which you would like to mimic in both color and shape
  2. Read your image into Python using numpy
  3. Read your text into Python using open() and read()
  4. Make your word cloud!

Continue reading