• Hackathon 2.0

    by Tyler Dellow • December 14, 2012 • Hockey • 15 Comments

    The great thing about the Oilers is that even when there’s no hockey being played, they still do odd things on which we can comment.

    The Oilers have announced the second edition of their analytics competition, named “Hackathon” for reasons that aren’t entirely clear to me. It’s been well known for some time that the Oilers are getting into analytics, although it’s not entirely clear to me that they’re going about it in a sensible way or that they’re taking it particularly seriously and I’ve got my doubts that they’ll get much useful out of it.

    The competition that the Oilers are running has a pretty cool first prize – you get a behind the scenes game day experience at Rexall, including a chance to watch the game with Oilers management, which would be something, but you also get the opportunity to participate in the Oilers’ Analytics Group, which is really cool if you’re into this stuff. Inside the throbbing brain of the operation!

    Except…it seems like a really weird contest if the object is to identify talented analytics types and suck them into Team Oiler. A preliminary objection: anyone who’s seriously into this stuff is on the internet somewhere, writing and talking about it, unless he’s working for an NHL team or trying to produce a product to sell, which is almost certainly black box garbage (see: the PowerScout people). If these people are on the internet somewhere, one wonders why the OIlers wouldn’t just search the internet and bring them in. We know that they know how to search through blogs – they did some sort of a search when they were assembling this group and decided that Bruce McCurdy was the best analytics guy on the internet, I assume. If they want to expand the group, I’m not sure why they wouldn’t find the second best analytics guy on the internet and bring him in as Bruce’s junior?

    In other words, a contest seems a convoluted sort of a mechanism to identify talent. This would be true even if the contest made sense. The contest doesn’t make sense. Contestants will be graded on four different tasks:

    1. Predict next regular season’s points/game for the players listed in appendix A.

    2. Predict next season’s even strength save percentage of the goaltenders listed in appendix A

    3. Predict the goal differential per regular season game ((goals for less goals against) divided by games played) for all thirty teams for the upcoming season.

    4. Conduct a predictive analysis of your choice on some dimension of potential value to the Oilers. The analysis must be testable in the upcoming season and judged on its difficulty, accuracy, clarity, and value

    The first two of these tasks are ridiculous. To start with, next year’s season is going to be an abortion of a season, with some guys coming into camp in shape and other guys coming in fat and happy. It won’t be like a normal NHL season in terms of preparation or practice. Mark Cuban talked about this last year and basically said that the lockout shortened season polluted all of the data. This will be true in the NHL as well. Nobody has any sort of a model that can account for this – people are going to guess and someone will get lucky.

    Second, the short sample sizes are going to skew everything like crazy. There are going to be, what, 50 games? A starting goalie will play 35 or 40? Crazy things happen in small samples with goalies and, truthfully, a 60 game workload is a small sample for a goalie. This should be intuitive to the Oilers – Nikolai Khabibulin reeled off a .919 save percentage in 2008-09 for Chicago, his best save percentage since the 2001-02 season. The Oilers signed him for four years, ignoring all the bad history, and now they have Taylor Hall, RNH and Nail Yakupov. These things are connected. It’s a silly thing to test people on, particularly in a short year.

    The same concerns exist with respect to skaters and points/game. A short season will skew things and some guys will have amazing years and some guys will have terrible years. I’m not even going to get into whether or not the Oilers should be that worried about player points/game – outside of some narrow issues with respect to timing of contracts, I don’t think that they should – but just the problem with doing any sort of projection of a short season, with all that entails.

    A repeatable ability to project goal difference might be a useful thing, although again, there are going to be problems due to the short season. In addition, in the absence of a projection system (and someone can win this without a projection system), I’m not sure how repeatable any of this is. If you’re in pursuit of analytic genius, you’d think that you’d want to find someone who’s got a repeatable system for projecting this stuff, not a guy who wins the 6/49 one time.

    Which brings me to the last question:

    Some examples of submissions for question four might be:
    - predict how many man-games each team will lose to injury
    - predict the shot differential per game of each team
    - predict which players who have yet to play 10 games in the NHL will have the highest point total in the next season

    The Oilers are really hung up on predicting points. Is Tambo in a hockey pool or something?

    I’m not sure what kind of data they’re going to make available to people – hopefully someone tells me at some point – but the first question, while fascinating, strikes me as being awfully difficult to answer without some detailed medical data from around the league, something I suspect isn’t coming. Again, I’ve got my doubts about how useful the answer is anyway; I’d bet that the teams that tend to lead the league in man-games lost tend to be teams that suffer a bunch of torn ligaments and broken bones during the course of the season. Good luck predicting that. Shot differential per game? I’m not even sure how this is useful information, given that shot differential will be impacted by score effects. How would this info help the Oilers win?

    So, you know, while I applaud the effort, I’m not really sure that I think they’re going to get a whole lot that’s useful out of it. That said, if anyone reading this does enter it, feel free to drop me an email and let me know what kind of data they’re making available and any experiences you have with it. Anonymity guaranteed.

    Email Tyler Dellow at mc79hockey@gmail.com

    About Tyler Dellow

    15 Responses to Hackathon 2.0

    1. dawgbone
      December 14, 2012 at

      The information they give is a combination of relevant (events with skaters who were on the ice) to the ridiculous (the draft results for every year since 1963).

      But yeah, if their goal is to predict player points, then they are looking for something that doesn’t exist.

    2. Woodguy
      December 14, 2012 at

      If you register for the contest, you get access to the data.

      I dare you to do it.

      Actually, I double dog dare you.

      • Cooliopedia
        December 14, 2012 at

        Dellow enters contest = Kobayashi Maru

    3. December 14, 2012 at

      From the CA: “I have and will learn confidential or proprietary
      information relating to the past, existing and contemplated operations of RSC and the
      Edmonton Oilers Hockey Club (the “Oilers”), including but not limited to personal, business,
      hockey, financial, strategic, statistical and investment information about RSC and the Oilers,
      their respective employees, subsidiaries, parent companies or affiliates, and the websites
      containing the information in the Hackathon package (the “Websites”) (collectively, the
      “Confidential Information”).”

      That sounds pretty cool and interesting, but it seems the reality is mostly publicly available data with no insight as to the purpose of the task at hand. Assuming Appendix A is made available at some point, I can’t imagine putting a whole lot of thought into what is essentially guesswork.

      Has anyone spent any time pouring through the data that Man City made available? I haven’t, but that seemed like a slightly better way to engage fans and cast a net to potentially find someone with something useful to contribute.

      Perhaps someone will knock their socks off on the fourth question, but this contest seems to just be a bit of harmless fun.

    4. Pingback: Spectors Hockey | NHL Lockout (Day 90) Blog Beat – December 14, 2012.

    5. Saj
      December 14, 2012 at

      I understand it’s a short season, but some of the non-points estimates seem reasonable to me. As a potentially fun exercise, I suggest you come up with a better way to judge hockey analytics, and potentially start a pool where interested parties can enter.

    6. December 14, 2012 at

      I like that they essentially say, “Hey, even if you aren’t eligible to win, you can send your stuff too! No sendbacks.” Can we say “Free Analysis Grab?” Like you said, Tyler, go on the internet sometime, anytime, and you can find stuff that’s not only pretty good but also pretty well-vetted by other people that create good analysis.

    7. December 14, 2012 at

      Aren’t you entering, Tyler?

      • Tyler Dellow
        December 14, 2012 at

        Nah. The Oilers know who I am and read my site at least occasionally. If they want to talk to me, they have my email address.

        • Brad
          December 14, 2012 at

          How do you know they read you? I couldn’t imagine the hilarity if you won. Would you have the nads to call out their body of work to managements face?

        • Rhys J
          December 15, 2012 at

          Step 1. Clever pseudonym. I suggest an anagram like “Drew L. Tolley”

          Step 2. Fake email address.

          Step 3. Win.

          Step 4. Carefully explain that your win was a fluke and that you can’t accurately predict the random minutiae of a hockey season. You may have to speak slowly here.

          Step 5. Rub elbows with Oilers players anyways. Be gentle though, you don’t want to cause another fluke shoulder injury.

    8. Woodguy
      December 14, 2012 at

      OT,

      Tyler,

      You’ve won me over to more on your side re: Oiler profitability pre-lockout.

      After reading your various posts on it I talked to my “guy who knows a guy” and he couldn’t refute much.

      That being said, he mentioned a massive windfall from the Heritage Classic which was used to lower the bank note aggressively to reach a different type of debt class (which a cash call *may* have used to top up to reach a threshold.)

      I still think the cash flow from normal operations was meh given the investment, so its not like their money couldn’t have been doing better elsewhere, but it wasn’t the march through the desert that Laforge makes it out to be.

    9. Pingback: Edmonton Oilers Hockey Hackathon | The Sports Juice

    10. Pingback: The Edmonton Oilers look to gain an edge with analytics & hackathons - MasterMaq's Blog

    11. Pingback: Oil On Whyte's Weekend Link Extravaganza - Oil On Whyte - An Edmonton Oilers Fan Site - News, Blogs, Opinion and More

    Leave a Reply

    Your email address will not be published. Required fields are marked *