19 Sep 2008
by Bill Connelly
(Ed. Note: A few of the numbers below have been slightly changed due to mistakes in the original editing of this article.)
Some people like being able to take a car apart piece by piece and put it back together, knowing where every part goes and accounting for everything (aside from a couple leftover screws or something). I'm not one of those people. I don't like living up (or down) to stereotypes, but there's way too much geek in me for that. And not to generalize or anything, but if you're reading this site on a daily basis, odds are decent that there's too much geek in you too.
But what about taking a football game apart and putting it back together? Awesome, right? It can be done on the computer, you don't get black gunk on your hands, you can do it while watching TV ... win-win situation. The thought behind my EqPts measure from last week (and therefore the PPP and S&P measures as well) is only one part of scoring points. It's the most important part by all means, but there are other factors involved -- namely, turnovers and special teams (and luck, but we're not measuring that yet -- consider that the leftover screws).
Is it possible to assign a point value to every play -- even "special event" plays like kicks and turnovers -- and piece together the score of a game? Let's find out.
We're going to explore the point values of turnovers, special teams, and penalties, but a couple of numbers should be noted right up front.
So before we go delving into these other categories, it should be noted that we're pretty close already. Do turnovers, special teams and penalties account for those missing six points?
In my last column, I referenced a method FO used to assign a point value to turnovers. I also mentioned that, as soon as I got rolling with my own data entry, I stopped looking at what others in the football stat world had done because I wanted to see what I could come up with on my own. Well, what I came up with turned out to be pretty damn similar. Again, the only difference is that the point values I ascribe to a play are based on the likelihood that a team is going to score on a particular drive; FO's work focused on where the next points were going to come from, on that drive or another one.
In just about any football game, you'll see a reference to turnover margin, or maybe points off of turnovers. But it doesn't take in-depth thinking to realize that not all turnovers are created equal. If a running back fumbles on his opponent's 1-yard line, that's a huge turnover because his team had a high level of expected points, and he threw them away. And if he fumbles on his own 1, that's also huge because it hands his opponents a high level of expected points. And if he fumbles on his opponent's 1, and it's returned for a touchdown, that's doubly huge -- it cost his team quite a few points and handed his opponents a touchdown. But in turnover margin, all three of those fumbles count the same as if some backup quarterback fumbled at midfield on the last play of a 49-7 game.
It seems clear that, as FO has covered in the past, counting the significance of two values -- the team's field position when the turnover happened, and the opponent's resulting field position -- gives you a much better view of a turnover's true costliness. And that's what we're going to try to do.
Let's look at two turnovers:
Using my numbers, Turnover 1 was worth 12.62 points (5.62 points for being at the opponent's 1, 7.00 points for being returned for a touchdown). Turnover 2 was worth 4.26 points (1.92 points lost/prevented, plus 2.34 points given/taken). Is that not a much more accurate read of which turnover truly impacted the result of the game and which did not?
So looking at these point readings can give us a much more accurate feel for teams' "Turnovers = Turnaround" potential in 2008*. Certain teams like Hawaii, Kansas, and Middle Tennessee benefited greatly from turnovers (the Turnover Points Margin solidifies that even further than Turnover Margin) and will almost certainly be due a turnaround in 2008. (Then again, Middle Tennessee just beat Maryland, so what do I really know?)
* Pretty sure Phil Steele has copyrighted "Turnovers = Turnaround" at this point, so I should probably credit him just to be on the safe side. Also, through all of these numbers, realize this: I also count botched punts/field goals as turnovers, so my Turnover Margin figures will likely be different than the official NCAA stats.
One other thing to remember about turnover numbers is that the net gain is 0. Turnovers produce points for one team and against another.
So now it's time to establish point values for special teams. Leaving PATs out of it for now, there are three major special teams categories (and a fourth minor one): Field goals, punts, kickoffs, and (here's the minor one) free kicks. Let's attack them one at a time.
Figuring out what to do about field goals was by far the easiest of these categories. I sorted field goals by distance in five-yard increments (18 to 22 yards, 23 to 27, 28 to 32, etc.), looked at the percentage made in each group, and multiplied the percentage by three (the value of a successful field goal) to determine the expected number of points from each kick. Here's what I found:
|Expected points by field goal distance|
|FG Range (yards)||Average percentage||Expected points|
|18 to 22 yards||91.4%||2.74|
|23 to 27 yards||88.1%||2.64|
|28 to 32 yards||80.3%||2.41|
|33 to 37 yards||69.4%||2.08|
|38 to 42 yards||67.1%||2.01|
|43 to 47 yards||58.1%||1.74|
|48 to 52 yards||45.6%||1.37|
|53 to 57 yards||35.0%||1.05|
So with that, we can treat every field goal like an addition or loss of points. For instance, if you miss a 25-yard field goal, it's a loss of 2.64 points. If you make it, it's worth 0.36 points. That may not seem like a lot, but you have to remember that the team has been adding (and possibly subtracting) points all the way up the field. To get to the opponent's 8-yard line, they've probably earned at least somewhere in the neighborhood of 2-3 EqPts, so the 0.36 points seems a lot more reasonable in that regard.
The field goal idea above was something of a no-brainer for me, but for punts, kickoffs, and free kicks, I had to toss around a few different ideas. Here's what I did (and this applies roughly to all three):
Got it? So the higher the point total, the better it is for the kicking team. The lower, the better for the receiving team. It's like net punting, only more useful and more confusing.
For simplicity's sake, I measured these exactly the same way as I did punts. You kick off from the 30, so that's the first point value in consideration. The second is, naturally, where the ball ends up. I played around with the idea of figuring out the average point value of each kick (for kickoffs that was 1.46) and comparing teams' averages to that (so that about half the teams would be positive, half negative). However, that leads you to the same order of teams, just with different values, so in the end it just became an extra, meaningless step.
This was a minor category. Out of more than 141,000 plays in 2007, there were 54 free kicks. They make a difference ... but not really. Very few teams were involved in more than one free kick in 2007. They're measured exactly the same way as kickoffs, only they're from the 20 instead of the 30, but nobody's "per game" totals are going to be much of anything.
Part of the reason I've done all these "points" measures is for predictive purposes, by all means, but I have another motive: I just love ranking things. And I thought that a "special teams points per game" type of measure would be great rankings fodder. However, there's a problem with that: Teams that score a lot are penalized in "per game" rankings because, well, they also kick off a lot. Per-game numbers will serve the purpose of "putting the car together," but I had to find a different idea for ranking special teams units.
I did this by adding together the "higher is better" numbers, then subtracting the "lower is better" numbers. So we get something like this:
Special Teams Avg. = Kickoff Return Avg. + Punt Av.g + (FG Avg. * 2) - Kickoff Avg. - Punt Return Avg.
(I multiplied field goal average by two so that field goals would carry the same weight as kickoffs and punts.)
So that leads to averages from No. 1 San Diego State (1.69) to No. 120 Duke (-2.95). That's right, San Diego State had the best special teams unit in the country last year. If only every play were based on special teams.
With Special Teams Points Per Game, however, you get a much wider spread. The No. 1 team in the country in per-game terms was Florida International (+8.94), simply because they returned a ton of kickoffs. Next up were San Diego State (+8.49), Syracuse (+7.62), Idaho (+7.32), and Eastern Michigan (+6.98).
Worst? Kansas (-9.77 PPG), Ohio State (-9.42), Hawaii (-7.18), West Virginia (-6.46), and Boise State (-6.27).
This one's easy. We've got two Penalty Points numbers: Offensive Penalty Points and Defensive Penalty Points. Both numbers are based on obvious concepts:
Offensive Penalty Points = EqPts gained from your opponents' defensive penalties – EqPts lost from your own offensive penalties.
Defensive Penalty Points are exactly the opposite. On a per-game basis, penalty margins ranged from Kansas (+4.37 per game), San Jose State (+3.87), and UConn (+3.76) at the high end to Florida International (-6.29), NC State (-5.66), and Idaho (-5.17) at the low end.
So here's the coolest part. I took the car apart, not knowing what would happen when I attempted to reassemble it, and here's what I got:
|Average Points Per Game of Various Events|
|Average Turnover Points Per Game*||2.15|
|Average Penalty Points Per Game*||2.15|
|Average Special Teams Points Per Game*||-0.39|
|Average EqPts Per Game||49.31|
|Total Projected Points Per Game||53.22|
* As a reminder, these are based off of margins. That’s why the numbers are just a bit over or under zero. On average, there are about 2.15 more Offensive Turnover Points per game than Defensive Turnover Points (remember, that number is based off of starting and ending field position); similarly, there are about 2.15 more Offensive Penalty Points per game than Defensive Penalty Points. Meanwhile, Special Teams points trended slightly toward the defensive side of the ledger.
Not bad at all. We can account for 53.22 of 55.34 points per game. Only a couple screws here and there are missing. But how are they distributed? Do individual teams' per-game Projected Points averages resemble their actual points? Yes and no.
Some teams match up unbelievably well between their actual points and projected points. Navy averaged 39.31 points per game in 2007. Their projected total? 39.27. Texas A&M: 27.92 vs 28.00. Washington: 29.23 vs 29.07.
But teams at the extreme ends of the scale saw bigger differences. West Virginia averaged 39.62 points per game but only put up a projected total of 29.70. Kansas' 42.77-point offense only saw 35.86 projected points. And on the low end, Syracuse managed only 16.42 points compared to 21.52 projected points. UNLV's numbers were just as different -- 17.25 points vs. 22.92 projected points.
So I'll wrap this up with a couple of questions:
1) What do you think causes the variance at the ends of the scale?
2) What should be done about it? Is it as simple as applying an exponential multiplier, making the high numbers higher and the low numbers lower? If this question can be answered reasonably and accurately, then the world is our statistical oyster. We can look at the specific points in the game most directly tied to wins and losses. We can come up with a reasonable way to account for the massive difference in talent from team to team in college (something that's obviously not as much of a problem in the NFL). We can look at college football in an entirely new way.
Which is a lot more fun to me than working on a car.
A couple of responses to last week's comments:
"Also, I wonder about the fact (if I'm understanding this right) that yards in your own territory are less valuable than yards gained elsewhere on the field. It would seem to me that the ability to get out of the shadow of your own endzone can be especially valuable."
There's definitely something to this, though I think some of it comes into play with punting and some of the special teams numbers. If you move the ball from your 1 to your 20, then uncork a 45-yard net punt, that will account for some EqPts that basically serve as a "points prevented" figure.
"To make comparisons, you need to adjust for defense. Sort of like the difference between OPS and OPS+, only even more so."
Remember last week's S&P measure? I used the OPS+ as a jumping-off point for my S&P+ idea, which I will discuss next week. No "park factors" involved as in baseball, but it is indeed an attempt to place everybody on an even playing field. Hawaii probably did not, indeed, have the No. 3 offense in the country last year, but they did have the No. 3 offensive stats, which is all we've been able to discuss so far. S&P+ will take a stab at the rankings.
"Do these numbers mean those offenses are 'good at winning college football games' or 'good at exploiting superiority in college football games?'"
A concept we'll look at in a couple weeks (after we've exhausted the '+' concept) is Win Correlations, which simply draws correlations between specific statistical categories and wins/losses, both for college football as a whole and for specific teams. It's a lot of fun -- I based my college football previews (scroll down for conference posts) off it.
12 comments, Last at 29 Dec 2008, 12:36pm by dogstar30