The Myth of 370 Carries.

SaltwaterServr

Blank Paper Offends Me
Messages
8,124
Reaction score
1
Forgot I had this site bookmarked: http://www.advancednflstats.com

Here's an article about running back "overuse". I see that term thrown around by the mediots of the 4-letter and in general football conversations. It's a good read and interesting how a stats-proud website doctored things to create a buzz about something that doesn't really exist, The 370 Carries Per Season Barrier.

http://www.advancednflstats.com/2008/07/drunkards-light-posts-and-myth-of-370.html

ul 27, 2008

Drunkards, Light Posts, and the Myth of 370

.fullpost{display:inline;} Running back overuse has been a hot topic in the NFL lately, partly because of Football Outsiders' promotion of their "Curse of 370" theory. In several articles in several outlets, including their annual Prospectus tome, they make the case that there is statistical proof that running backs suffer significant setbacks in the year following a season of very high carries. But a close examination reveals a different story. Is there really a curse of 370? Do running backs really suffer from overuse?


Football Outsiders says:

"A running back with 370 or more carries during the regular season will usually suffer either a major injury or a loss of effectiveness the following year, unless he is named Eric Dickerson.

Terrell Davis, Jamal Anderson, and Edgerrin James all blew out their knees. Earl Campbell, Jamal Lewis, and Eddie George went from legendary powerhouses to plodding, replacement-level players. Shaun Alexander struggled with foot injuries, and Curtis Martin had to retire. This is what happens when a running back is overworked to the point of having at least 370 carries during the regular season."While it's true that RBs with over 370 carries will probably suffer either an injury or a significant decline in performance the following year, the reason is not connected to overuse. What Football Outsiders calls the 'Curse of 370' is really due to:
  1. Normal RB injury rates
  2. Natural regression to the mean
  3. A statistical trick known as multiple endpoints
  4. (And this should go without saying, but the "unless he is named Eric Dickerson" constraint is silliness.)
Injury Rate Comparison

In the 25 RB seasons consisting of 370 or more carries between the years of 1980 and 2005, several of the RBs suffered injuries the following year. Only 14 of the 25 returned to start 14 or more games the following season. In their high carry year (which I'll call "year Y") the RBs averaged 15.8 game appearances, and 15.8 games started. But in the following year ("year Y+1"), they averaged only 13.0 appearances and 12.2 starts. That must be significant, right?

The question is, significant compared to what? What if that's the normal expected injury rate for all starting RBs? If you think about it, to reach 370+ carries, a RB must be healthy all season. Even without any overuse effect, we would naturally expect to see an increase in injury rates in the following year.

In retrospect, comparing starts or appearances in such a year to any other would distort any evaluation. This is what's known in statistics as a selection bias, and in this case it could be very significant.

We can still perform a valid statistical analysis however. We just need to compare the 370+ carry RBs with a control group. The comparison group was all 31 RBs who had a season of 344-369 carries between 1980 and 2005. (The lower limit of 344 carries was chosen because it produced the same number of cases as the 370+ group as of 2004. Since then there have been several more which were included in this analysis.)

Fortunately there is a statistical test perfectly suited to comparing the observed differences between the two groups of RBs. Based on sample sizes, differences between means, and standard deviation within each sample, the t-test calculates the probability that any apparent differences between two samples are due to random chance. (A t-test results in a p-value which is the probability that the observed difference is just due to chance. A p-value below 0.05 is considered statistically significant while a high p-value indicates the difference is not meaningful.) The table below lists each group's average games, games started, and the resulting p-values in their high-carry year and subsequent year.

.nobrtable br {display: none} table {border-collapse: collapse; border-width: 1px 1px 1px 1px; border-style: solid; } th {padding: 3px} td {text-align: center; padding: 3px;} #logocell {padding: 0px 3px 0px 3px; } #colorcol {background-color:#ffffe0}
Comparison of Games Played and Started for High-Carry RBs










G Year YG Year Y+1GS Year YGS Year Y+1370+ Group15.813.015.812.2344-369 Group15.814.015.412.6P-Value0.620.68


The differences are neither statistically significant nor practically significant. In other words, even if the sample sizes were enlarged and the differences became significant, the difference in games started between the two groups of RBs is only 0.4 starts and 1.0 appearances. RBs with 370 or more carries do not suffer any significant increase in injuries in the following year when compared to other starting RBs.

Regression to the Mean

The 370+ carry group of RBs declined in yards per carry (YPC) by an average of 0.5 YPC compared to a decline of 0.2 YPC by the 344-369 group. This is an apparently statistically significant difference, but is it due to overuse?

Consider why a RB is asked to carry the ball over 370 times. It's fairly uncommon, so several factors are probably contributing simultaneously. First, the RB himself was having a career year. He was probably performing at his athletic peak, and coaches were wisely calling his number often. His offensive line was very healthy and stacked with top blockers. Next, his team as a whole, including the defense, was likely having a very good year. Being ahead at the end of games means that running is a very attractive option because there is no risk of interception and it burns time off the clock. Additionally, his team's passing game might not have been one of the best, making running that much more appealing. And lastly, opposing run defenses were likely weaker than average. Many, if not all of these factors may contribute to peak carries and peak yardage by a RB.

What are the chances that those factors would all conspire in consecutive years? Linemen come and go, or get injured. Opponents change. Defenses change. Circumstances change. Why would we expect a RB to sustain two consecutive years of outlier performance? The answer is we shouldn't. Running backs with very high YPC will get lots of carries, but the factors that helped produce his high YPC stats are not permanent, and are far more likely to decline than improve.

If I'm right, we should see a regression to the mean in YPC for all RBs with peak seasons, not just very-high-carry RBs. The higher the peak, the larger the decline the following year. And that's exactly what we see in the data.


The graph above plots RB YPC in the high-carry year against the subsequent change in YPC. The blue points are the high-carry group, and the yellow points are the very-high-carry group. Note that there is in fact a very strong tendency for high YPC RBs to decline the following year, regardless of whether a RB exceeded 370 carries.

Very-high-carry RBs tend to have very high YPC stats, and they naturally suffer bigger declines the following season. 370+ carry RBs decline so much the following year simply because they peaked so high. This phenomenon is purely expected and not caused by overuse.

Statistical Trickery

Why did Football Outsiders pick 370 as the cutoff? I'll show you why in a moment, but for now I'm going to illustrate a common statistical trick sometimes known as multiple endpoints by proving a statistically significant relationship between two completely unrelated things. I picked an NFL stat as obscure and random as I could think of--% of punts out of bounds (%OOB).

Let's say I want to show how alphabetical order is directly related to this stat. I'll call my theory the "Curse of A through C" because punters whose first names start with an A, B, or C tend to kick the ball out of bounds far more often than other punters. In 2007 the A - C punters averaged 15% of their kicks out of bounds compared to only 10% for D - Z punters. In fact, the relationship is statistically significant (at p=0.02) despite the small sample size. So alphabetical order is clearly related to punting out of bounds!

Actually, what I did was sort the list of punters in alphabetical order, and then scanned down the column of %OOB. I picked the spot on the list that was most favorable to my argument, then divided the sample there. This trick is called multiple endpoints because there are any number of places where I could draw the dividing line (endpoints), but chose the most favorable one after looking at the data. Football Outsiders used this very same trick, and I'll show exactly how and why.

The graph below plots the change in yards per carry (YPC) against the number of carries in each RB's high-carry year. You can read it to say, a RB who had X carries improved or declined by Y yards per carry the following year. The vertical line is at the 370 carry mark.


Note the cluster of RBs highlighted in the top ellipse with 368 or 369 carries. They improved the following year. Now note the cluster of RBs highlighted in the bottom ellipse. They had 370-373 carries and declined the next year.

If we moved the dividing line leftward to 368 then the very-high-carry group would improve significantly. And if we moved line rightward to 373, then the non-high carry group would decline. Either way, the relationship between high carries and decline in YPC disappears. There is one and only place to draw the dividing line and have the "Curse" appear to hold water.

To be fair to Football Outsiders, they have recently admitted there is nothing magical about 370. A RB isn't just fine at 369 carries, and then on his 370th his legs will fall off. But unfortunately, that's the only interpretation of the data that supports the overuse hypothesis. If you make it 371 or 369, the relationship between carries and decline crumbles. It's circular to say that 370 proves overuse is real, then claim that 370 is only shorthand for the proven effect of overuse. It's pretty clear from the graph above that they assumed overuse was real, then sought an analysis to support their claim.

As Mark Twain (reportedly) once said, "Beware of those who use statistics like a drunkard uses a light post, for support rather than illumination."
 

Temo

Well-Known Member
Messages
3,946
Reaction score
362
They didn't "doctor" anything; they just made a common interpretation error that even a lot of statisicians make (me included).

All that being said, I do believe there is a case that over-using running back would probably limit their lifespan and effectiveness, though there is not "curse of 370"
 

burmafrd

Well-Known Member
Messages
43,820
Reaction score
3,379
over use of a RB is a very valid problem no matter what smoke screens you put up. Putting a precise number on that is stupid because then the bean counters can then prove you wrong.
Bottom line is that how many of those RBs that got a huge number of carries in one year ever really did it again as well?
So the basic premise that you run a RB way too much one year you could cut his career short or use him up so much that he is never the same is still valid correct?
 

Alexander

What's it going to be then, eh?
Messages
62,482
Reaction score
67,294
burmafrd;2832547 said:
over use of a RB is a very valid problem no matter what smoke screens you put up. Putting a precise number on that is stupid because then the bean counters can then prove you wrong.
Bottom line is that how many of those RBs that got a huge number of carries in one year ever really did it again as well?
So the basic premise that you run a RB way too much one year you could cut his career short or use him up so much that he is never the same is still valid correct?
This is hardly even worth the time to analyze anymore.

Most teams carry two or even three backs and try to distribute the load for a reason. That main reason is that you are going to be hard pressed to find an Eric Dickerson or Emmitt Smith who can avoid taking the punishment that goes along with that kind of workload.

You don't need to count beans to see the toll some of these players take. Three years ago, Larry Johnson was considered a stud. Now, he is trudging along.

It is irrelevant anyways because we are one of those teams that plans to spread the carries.
 

Doomsday101

Well-Known Member
Messages
107,762
Reaction score
39,034
I think the fact that in college teams are using multiple backs it is no accident that we see fewer and fewer backs in the pros who can carry the load 20 to 25 times a game.
 

SaltwaterServr

Blank Paper Offends Me
Messages
8,124
Reaction score
1
Temo;2832516 said:
They didn't "doctor" anything; they just made a common interpretation error that even a lot of statisicians make (me included).

All that being said, I do believe there is a case that over-using running back would probably limit their lifespan and effectiveness, though there is not "curse of 370"

I wouldn't necessarily call it a common interpretation error, I'd say it was a deliberate misrepresentation. They even admitted as much in a subsequent article.

You do bring up the point, as others did subsequently, that the overuse can seriously shorten a runningback's theoretical career if they were used more sparingly. I believe that to be a certainty.

Moreover, its why I think Westbrook's on the second season of the beginning of the end of his career. While not huge numbers when you add his carries and catches together for the last few seasons:

2008 - 287 touches
2007 - 368 touches
2006 - 317 touches

but if you add in his playoff touches:

2008 - 344 touches
2007 - 371 touches
2006 - 355 touches

and then figure in the plays he's in the backfield picking up blitzing LB's, DE's, and DT's on stunts you can kind of get the idea that despite he's only going to be 30 at the beginning of the season he's pretty close to used up at this point.
 

NextGenBoys

Well-Known Member
Messages
9,252
Reaction score
1,964
SaltwaterServr;2833624 said:
I wouldn't necessarily call it a common interpretation error, I'd say it was a deliberate misrepresentation. They even admitted as much in a subsequent article.

You do bring up the point, as others did subsequently, that the overuse can seriously shorten a runningback's theoretical career if they were used more sparingly. I believe that to be a certainty.

Moreover, its why I think Westbrook's on the second season of the beginning of the end of his career. While not huge numbers when you add his carries and catches together for the last few seasons:

2008 - 287 touches
2007 - 368 touches
2006 - 317 touches

but if you add in his playoff touches:

2008 - 344 touches
2007 - 371 touches
2006 - 355 touches

and then figure in the plays he's in the backfield picking up blitzing LB's, DE's, and DT's on stunts you can kind of get the idea that despite he's only going to be 30 at the beginning of the season he's pretty close to used up at this point.

As a Cowboys fan I love it.

But as a Westbrook fan, thats too bad.
 

dwmyers

Well-Known Member
Messages
2,373
Reaction score
522
I've seen this stuff before. As debunk material, it's nonsense. They abuse the whole notion of the "rule of 370" by pretending it's some kind of step function. As a consequence, the whole analysis devolves to proving that the injury function is a relatively smooth curve going from 350ish to 370ish. It ignores the base observation, the sheer number of injuries once you begin to exceed that threshhold.

This is the kind of bull you expect from a college freshman or a right wing talking head, not serious statisticians.

And so Saltwater, I'm calling your bluff.. just how many running backs survive those kinds of loads?

The answer is, almost none of them do.

DWM
 

burmafrd

Well-Known Member
Messages
43,820
Reaction score
3,379
dwmyers;2834331 said:
I've seen this stuff before. As debunk material, it's nonsense. They abuse the whole notion of the "rule of 370" by pretending it's some kind of step function. As a consequence, the whole analysis devolves to proving that the injury function is a relatively smooth curve going from 350ish to 370ish. It ignores the base observation, the sheer number of injuries once you begin to exceed that threshhold.

This is the kind of bull you expect from a college freshman or a right or left wing talking head, not serious statisticians.

And so Saltwater, I'm calling your bluff.. just how many running backs survive those kinds of loads?

The answer is, almost none of them do.

DWM

fixed that for you so you don't get dinged for political comment.

Larry Johnson is probably the best current example- though LT might beif he does not do good this year.
 

Tovya

New Member
Messages
777
Reaction score
0
I think the answer is simple, and it goes with any position--the more times you are directly involved in a play, the larger the chance of getting injured.

But then again, that's just life in general. Give me 20 people who drive 12 hours a day, and 20 who do it for 1, and I'd almost guarantee that the wreck risk of those who drive more frequently goes up.

It's not exactly rocket science, but it is common sense.
 
Top