Nascent 2012 analytics notes

dwmyers

Well-Known Member
Messages
2,373
Reaction score
522
Where I discuss the need for weekly pbp data that we can use, before delving into the mystery that is Pro Football Reference's expected points curves.

http://codeandfootball.wordpress.com/2012/08/21/nascent-2012-analytics-questions/

Excerpt:

I’ve suggested to Neil Paine that they publish their EP model data separately from their play by play data. A blog post with 1st and ten, 2nd and ten, 3rd and ten curves would give those of us in the wild a fighting chance to figure out how PFR actually came by their numbers.
 
dwmyers;4678655 said:
Where I discuss the need for weekly pbp data that we can use, before delving into the mystery that is Pro Football Reference's expected points curves.

http://codeandfootball.wordpress.com/2012/08/21/nascent-2012-analytics-questions/

Excerpt:

I'm not really sure what the article you posted is talking about. Seems like there's some back dialog in the mix.

But

That is a kick *** site. Thanks for the post. There are some articles on there that are really interesting.
 
The article doesn't stand on its own. There are a host of issues that you have to be familiar with before things like "the value of a touchdown" makes sense..

That phrase is typically used in expected points models to mean the average value of a touchdown minus the value, on the field, of the average runback. Before 2011, that was about 26-27 yards. Typical values before 2011 were 6.3 to 6.4 points.

The PFR model appears to value the touchdown as 7 points. Further, with a model derived value of 6.97 points for first and 10 on the 1 yard line, they appear to be saying that the odds of a touchdown on the 1 exceeds 99%. That's not true. Their model has some issues.

Things like this make it worthwhile to suggest to those folks that they tell us how they came by their model.

Almost all the modern stats (EP, win probability) have to be derived by parsing play-by-play data into a spreadsheet or database, and then you analyze them for the kinds of results you want. Run success calculations require a set of play by play data. If you start doing run success calculations, and then folks start arguing about whether a running back is "done for" ..

It's going to be hard to read my site unless you spend a bit of time on the articles I wrote in August and September of last year. That was an amazingly productive period. A number of the results published then were original. A fair number of those articles were retweeted or linked to by Brian Burke.

And thank you all for the "thumbs up!"

D-
 
agree that there is no way its 99% for a TD if you get a first and 10 at the 1.

Now 85% or so I can believe. But no way its 99%
 
Some more stuff to look at.

Drive ending probabilities, from Keith Goldner's blog:

Screen+Shot+2012-08-01+at+11.03.51+AM.png


The article I reference this from is here:

http://www.drivebyfootball.com/2012/08/modifying-markov-model.html

This is first and ten data, Keith Goldner's original Markov Chain EP model..

Screen+shot+2011-05-11+at+1.15.59+PM.png




These are his newer models:

Screen+Shot+2012-08-01+at+12.37.32+PM.png


None of them have intercepts anywhere near 7.0 points.

D-
 

Staff online

Latest posts

Forum statistics

Threads
474,003
Messages
14,505,792
Members
24,207
Latest member
TomGiantsfan
Back
Top