Climate Change, Crowbars and Strikeouts

chartsnthings 2013-04-11

Summary:

Just over a week ago we published a graphic – more of an interactive short blog post without a blog, really – that accompanied Tyler Kepner’s piece about strikeouts for the Times’ 2013 baseball preview. The subject of both pieces was the steep increase in strikeouts across the board in the past decade: last year, ten Major League clubs set franchise records for strikeouts.

The fact Tyler came to us with was one he’d found on his own: 18 teams struck out at least 1,200 times last season; through 2005, there had never been a season in which more than two teams topped that total. Below, the first sketch, based on that stat – the number of teams with 1,200 strikeouts or more in a season going back to 1968:

image

That’s a compelling chart, but it’s also a little misleading because the league has expanded a few times and not all seasons are the same length.

Instead, Joe Ward and I thought about making small multiples of the teams and arranging them in a sort of histogram, sort of like my colleague Bill Marsh did with exit polls in 2008 and 2012.

Here are the first nine teams in alphabetical order, with the league average in grey:

image

We didn’t really care for these, and I complained about it to my colleague and cubicle-partner Alicia Desantis, who suggested I make it look like the climate change “hockey stick charts.” (FYI, The image below, one of the better ones from Wikipedia, is meant to convey the form, not wade into the “Hockey Stick controversy“ if you believe there is one.)

image

Here’s what the first R sketch of that idea looked like – every team’s average strikeouts per game per year. (Red is the league average.)

image

At this point, we had a chart we liked and the process went forward like many of our other projects do. However, there was a key difference with this one that’s worth mentioning - all the rest of the sketches, edits and and design improvements happened in a web browser. (More on this later.)

Here are a few successions of this chart, made using D3:

Checkin #2

image

Checkin #6

image

Checkin #22

image

As it appeared when published (Checkin #142)

image

UPDATE, now with more Voronoi, as per Mike’s request:

A few final technical notes worth mentioning:

Getting this data from baseball-reference.com requires a bit of scraping, and this project sold me for life on R’s XML package, which makes scraping fast and shamefully easy.

In the final project, there are three interactive charts and a table on the page, and they are all generated in D3 with just one data file. The whole chart form – line selection, tooltip, calculating averages – is easily abstracted out, and for the first time I felt some of the same sketching power in a browser that I’d seen only with R: the concept that if you can make one chart, you can make a hundred with the same effort. But with D3, the sketches are already in a browser and wired for interaction! From a development point of view, it felt tremendously powerful. (For many of you this might be obvious, but old habits die hard.)

Also, thanks to the open-source SVG Crowbar bookmarklet develope

Link:

http://chartsnthings.tumblr.com/post/47670081904

From feeds:

Statistics and Visualization » chartsnthings

Tags:

baseball r d3 joe ward data sketches

Date tagged:

04/11/2013, 18:45

Date published:

04/10/2013, 22:15