Welcome to NHL-ML-Draft

Blog -- Disclaimer, Data Sources, Content Licensing
2019-09-30 00:00:00 -- meta, data


The information provided on this website does not, and is not intended to, constitute advice and should not be acted on as such. Instead, all information, content, and materials available on this site are for general informational purposes or experiemntal purposes only. Information on this website may not constitute the most up-to-date information. This website contains links to other third-party websites. Such links are only for the convenience of the reader, user or browser; the NHL ML Draft does not do not recommend or endorse the contents of the third-party sites.

Sources of Data:

Whether or not data is copyrightable is a bit of a gray area. The short version is that data is not copyrightable, but presentation, the act of compilation, etc. probably is.

Many hockey data sites list their policy on usage of their content. Here are a few examples.

These days, we also have popular, pre-fabricated licenses written by experts to govern use of content.

For example, QuantHockey uses the Creative Commons Attribution + Noncommerical license, which, as the name suggests, means you can use the data, must name QuantHockey as the source, and should not make any money by presenting data from this source (the use of ads on QuantHockey then seems questionable, but I'm not sure enough of the details to say whether or not that's permitted by the license).

Everything on Wikipedia is licensed under the Creative Commons Attribution - ShareAlike license. This is the same as above, but without the commerical usage restriction. This site regards Wikipedia as a viable source of data for this reason. However, to be pedantic, we'd have to make sure all that data arrived there from sources with licenses/policies that permitted that.

Because we use Wikipedia for all pre-2019 per-player data, we get some biases in the data. For example, data about a 1st rounder is far more likely to be available on Wikipedia than a 7th rounder who never made the NHL. More on these biasess in a later post.

For new data (e.g., current draft year), I mostly hand-pick data from EliteProspects. EliteProspects doesn't appear to publish a usage policy, so I use the common sense approach of don't take too much, and don't directly compete (I don't post propspect stats here, for example).

EliteProspects' league leaders data are also used (e.g.).

Here is a summary of the data sources used to generate ml draft's content:

Usage of Content:

In some sense, the content provided by ML draft is best viewed as a derived work of data provided by Wikipedia. Therefore, we must respect the way that data is licensed. Since the Creative Commons Attribution + ShareAlike license is copyleft, content on this website is also licensed under that license: