Every data visualization and information design project involves data cleaning and preparation. Love it or hate it (most people feel the later), 'data munging' is a necessary step and unique skill in the creation of good work. The Python library Pandas provides a terrific set of tools to do just that. The Wikipedia page for … Continue reading Using Pandas for quick data cleaning and preparation
Notes and Resources I gave a talk at last night’s CreateInTO event and it went really well! I very much appreciate that people came out on a cold, rainy October night. The talk was about my experiences in using Processing and P5.js for prototyping and data visualization. I used examples from two recent projects, Visualizing … Continue reading CreateInTo Talk Follow-up
Screencaps of a Work in Progress How much does the UN spend each year? Where does that money go? Who's spending it? Is it concentrated in a few countries or evenly spread around the world?? To answer some of these questions for myself, I started where I always start. With the data!! A quick search … Continue reading Visualizing $23 Billion in UN Agency Funding
The Travelling Salesperson Problem is a famous problem in computer science. The gist of it is as follows: "Given a list of cities and the distances between each pair of cities, what is the shortest possible route that visits each city and returns to the origin city?" It is an NP-hard problem in combinatorial optimization, … Continue reading Visualizing a Genetic Algorithm Attempting to Solve the Travelling Salesperson Problem
What are the limits of small multiples? At what point do the charts lose their individual meaning and blend back into a single, collective image/pattern? This is a quick experiment using bar graphs with random values to test the visual limits of small multiples.
Came out of a tweet from @phillipadsmith about an infographic. Got me thinking about the difference between an infographic and data vis. Not new territory but interesting to think about nonetheless. Edit: A simpler version of the graphic above. Made with Paper
I recently started using Processing (processing.org) at the CBC to visualize the dependencies of the content areas on projects being built by Media Ops & Technology (MO&T). Roughly speaking, MO&T builds out platform related projects and the content areas leverage the functionality of those projects to build out their sites. The previous post here was a first sketch … Continue reading Creating Data Visualizations for the CBC
The Top 50 Gawker Media Passwords - Digits - WSJGreat use and analysis of the Gawker password database by Zach Seward of the WSJ.com.
This image,Tsu-20041226-005853UTC, was created shortly after the Christmas 2004 tsunami in the Indian Ocean. It was made using satellite images and population density information for the affected countries in the Indian Ocean. I wrote a program in Processing to read this data and generate the black and white representation below. Lines representing people who died from … Continue reading Data Visualizations with Processing, TSU prints