Colby Cosh had a good line a while back about the NHL and statistics (he was specifically talking about high event versus low event players, but the quote is of general application):
This will be the next stat revolution in hockey when the league finally realizes it already has the data–it just needs to present them to the fan. (Ideally, in some intelligible form that encourages number-crunching, as opposed to making it a bigger pain in the ass than rectal cancer.)
The NHL has presented more data to fans this year than ever before and they should be commended for that. They should be vehemently criticized though, for the fact that it’s harder to compile than ever. I used to have a spreadsheet with a VBA script that did most of the dirty work for me; it’s been rendered useless by the changes to the site. One of my favourite developments is an apparently new policy whereby they present different information on the event sheets and game sheets from game to game. For instance, they started presenting shots broken down by game state this year. It’s not compiled anywhere on their site (FYI: they seem to have difficulty compiling information correctly as well) so if you’re interested, you need to scrape it. Of course scraping it is a pain in the ass because you don’t know where it will be from game to game. As a source of anything more than general statistics, their site is borderline useless. I can’t believe that this would be that difficult to fix; hire a computer science student for a summer and turn him loose. They would do well to pay attention to MLB.com, who are currently making the Pitch f/x data available free of charge almost instantaneously, in an easy to grab format.
Read the rest of this entry »