Nascent 2012 analytics notes

dwmyers · Aug 21, 2012

Where I discuss the need for weekly pbp data that we can use, before delving into the mystery that is Pro Football Reference's expected points curves.

http://codeandfootball.wordpress.com/2012/08/21/nascent-2012-analytics-questions/

Excerpt:

I’ve suggested to Neil Paine that they publish their EP model data separately from their play by play data. A blog post with 1st and ten, 2nd and ten, 3rd and ten curves would give those of us in the wild a fighting chance to figure out how PFR actually came by their numbers.

koolaid · Aug 21, 2012

Cool website. Very different,

Hoofbite · Aug 22, 2012

dwmyers;4678655 said:
Where I discuss the need for weekly pbp data that we can use, before delving into the mystery that is Pro Football Reference's expected points curves.

http://codeandfootball.wordpress.com/2012/08/21/nascent-2012-analytics-questions/

Excerpt:

I'm not really sure what the article you posted is talking about. Seems like there's some back dialog in the mix.

But

That is a kick *** site. Thanks for the post. There are some articles on there that are really interesting.

dwmyers · Aug 23, 2012

The article doesn't stand on its own. There are a host of issues that you have to be familiar with before things like "the value of a touchdown" makes sense..

That phrase is typically used in expected points models to mean the average value of a touchdown minus the value, on the field, of the average runback. Before 2011, that was about 26-27 yards. Typical values before 2011 were 6.3 to 6.4 points.

The PFR model appears to value the touchdown as 7 points. Further, with a model derived value of 6.97 points for first and 10 on the 1 yard line, they appear to be saying that the odds of a touchdown on the 1 exceeds 99%. That's not true. Their model has some issues.

Things like this make it worthwhile to suggest to those folks that they tell us how they came by their model.

Almost all the modern stats (EP, win probability) have to be derived by parsing play-by-play data into a spreadsheet or database, and then you analyze them for the kinds of results you want. Run success calculations require a set of play by play data. If you start doing run success calculations, and then folks start arguing about whether a running back is "done for" ..

It's going to be hard to read my site unless you spend a bit of time on the articles I wrote in August and September of last year. That was an amazingly productive period. A number of the results published then were original. A fair number of those articles were retweeted or linked to by Brian Burke.

And thank you all for the "thumbs up!"

D-

burmafrd · Aug 23, 2012

agree that there is no way its 99% for a TD if you get a first and 10 at the 1.

Now 85% or so I can believe. But no way its 99%

dwmyers · Aug 23, 2012

Some more stuff to look at.

Drive ending probabilities, from Keith Goldner's blog:

Screen+Shot+2012-08-01+at+11.03.51+AM.png

The article I reference this from is here:

http://www.drivebyfootball.com/2012/08/modifying-markov-model.html

This is first and ten data, Keith Goldner's original Markov Chain EP model..

Screen+shot+2011-05-11+at+1.15.59+PM.png

These are his newer models:

Screen+Shot+2012-08-01+at+12.37.32+PM.png

None of them have intercepts anywhere near 7.0 points.

D-

dwmyers · Aug 23, 2012

Just as a fyi, this follow up article (more a summary of the discussion above) has been retweeted and my site is exceptionally active..

http://codeandfootball.wordpress.co...pro-football-reference-expected-points-model/

Nascent 2012 analytics notes

dwmyers

Well-Known Member

koolaid

Drink Me

Hoofbite

Well-Known Member

dwmyers

Well-Known Member

burmafrd

Well-Known Member

dwmyers

Well-Known Member

dwmyers

Well-Known Member

Staff online

Members online

Latest posts

Forum statistics