From 1985 to 2008, there have been 1536 teams in the tourney, or 96 teams of each seed. Below, based on all this data, I’ve compiled what each round looks like on average in terms of the number of teams left of each seed. The closer your bracket looks like this framework, the more the historical performance suggests that you’ll have success. Then pick which of the seeds continue to advance. For example, if by a certain round there are expected to be 2.5 #7 seeds left, you might want to rule out one 7-seeded team for sure, include two of them for sure, and then base whether or not you would pick the remaining one on the context.
How would you know what specific teams to pick? Your own thoughts, experts, or simply just random guessing. The last one is especially fun if you want to try and annoy people who follow college basketball religiously. Obviously, with so much going on in the tourney, this approach is nowhere near a guarantee of even having the best bracket among your friends more than 50% of the time. And if you’re trying to get the best bracket in the country, screw any advice and just pick random upsets.
If you do follow this framework, please tell me how it worked out for you. There are some more advanced tips after the data:
_________________________________________________________________
(Round of 64 has four of each seed)
Round of 32:
#1 seeds: 4.00
#2 seeds: 3.83
#3: 3.38
#4: 3.17
#5: 2.71
#6: 2.75 (not a typo, 6 seeds have done better than 5 seeds)
#7: 2.50
#8: 1.83
#9: 2.17 (again, 9 seeds have done better than 8 seeds)
#10: 1.50
#11: 1.25
#12: 1.29
#13: .833
#14: .635
#15: .167
#16: 0
Sweet Sixteen:
#1: 3.50
#2: 2.50
#3: 2.00
#4: 1.71
#5: 1.46
#6: 1.50
#7: .750
#8: .375 (Low because always has to beat the #1 seed to get here)
#9: .125 (Ditto)
#10: .750
#11: .458
#12: .667
#13: .167
#14: .0416
#15: 0
#16: 0
Elite Eight:
#1: 2.88
#2: 1.83
#3: .958
#4: .583
#5: .209
#6: .500
#7: .250
#8: .250
#9: .0416
#10: .292
#11: .167
#12: .0416
#13: 0
#14: 0
#15: 0
#16: 0
Final Four:
#1: 1.75
#2: .875
#3: .500
#4: .375
#5: .167
#6: .125
#7: 0
#8: .125
#9: .125
#10: 0
#11: .0832
#12: 0
#13: 0
#14: 0
#15: 0
#16: 0
Championship Game: Didn’t find data
Champion: Didn’t find data
_________________________________________________________________
More complicated stuff:
- There are a lot of ways to work in the decimal part of the average besides just rounding to the nearest whole number. One interesting way would be to go to http://www.random.org, and use the number generator they have on the right with a min of zero and a max of 99. If the decimal part of the average is greater than this number, round up. If it’s lower, round down. Here’s an example: you don’t know whether to send one or two #4 seeds to the Sweet Sixteen. The average is 1.71. If the random number generator is less than 71, pick two teams. If it’s greater than 71, pick only one team. If it’s 71, run it again. This is a really easy way to generate an accurate estimation of number of upsets in your bracket.
- If you fill in your bracket from the best seeds and work your way down, take into account the seedings of each individual matchup versus the expected seeding. For example, let’s say in the first round you had a 13 seed upset a 4 seed and now that 13 seed is facing a 5 seed in the round of 32. The 5 seed’s chances of winning just got better. Even if this appears to screw up your closeness to the historical averages, it really doesn’t, as 5 seeds historically beat 13 seeds more often than they beat 4 seeds. This isn’t true for every seed combo, though – the difference between 7 and 10 seeds seems to be negligible once you get past the first round.
- If you fill in your bracket from the worst seeds and work your way up, keep in mind that the data just refer to the average number of teams of a seed left in each round, not a given team’s chance of getting to that round if they have already made an upset. For instance, if you have an 9 seed knock off a 1 seed in the Round of 32, their chances of also winning in the Sweet Sixteen aren’t nearly as bad as the .0416 (average number of #9 seeds in the Elite Eight) figure would suggest.
- If you want to figure in actual knowledge of college basketball in a more prominent way, you could forget this framework altogether and assign probabilities to each game. Then use the random number generator in the same way as the first tip – for example, if a team has a 60% chance of winning, have them advance if 60 is greater than the number generated and have the other team advance otherwise. This is a particularly good option if you’re filling out multiple brackets, as each of your brackets would essentially be a separate random simulation of the tournament based on as much of what you know as possible. Since you probably just want at least one to work out, the random factor would insure you against a wrong pick in one draft because you might have had it right in the other.
6 responses so far ↓
1 Daniel Adler // Mar 17, 2009 at 11:29 pm
David, really interesting post. I like your analysis and I’d love to hear what the group has to say. We should definitely discuss this during our meeting tomorrow.
However, I’m not sure I agree with your picking strategy. These games are essentially independent events (unless you feel that a #12 seed winning makes other #12’s less likely to win because they’ll see that the tourney already has its yearly allotment of #12 winners). Overall, you still want to pick the team that has the best chance of winning. Even though only about 3 #1’s make it to the elite 8, that still means that each team has a 75% chance of getting there…do you want to pick against that?
Last year, we had a big discussion and decided that if your goal is to win, there are two very different strategies depending on pool size. In a small pool, you should pick the most likely winners and accept that it won’t be quite perfect. In a large pool, you may want to pick some crazy upsets (decreasing your expected value, but increasing the variance) so you have a chance of doing a phenomenal job and winning. Your strategy seems good for a large pool.
2 David Roher // Mar 18, 2009 at 2:05 pm
I agree completely with the small pool vs. large pool problem. It would be interesting to see how we would draw the line between a small pool and a large one. Obviously, you’d be better off with a spectrum of increasing randomness than an actual line per se. I know absolutely nothing about game theory, but it would be interesting to apply it and see how strategy changes as the number of pool participants go up. Maybe someone with some experience in this could shed some light on it?
I would also argue that my strategy is optimal if you’re filling out multiple brackets, even if both are in small pools. Do you think that’s the case?
3 marvell // May 28, 2009 at 6:43 am
5dqNiI hi! mi site is http://norffg.com
see you!
4 Junior // Jul 12, 2009 at 2:32 pm
I never really broke it down like this before. I kind of figured #1 seeds were that way for a reason but every year they make a big deal about upsets. This breakdown lets me know that the rankings going in do still matter, however, those upsets are quite exciting unless your team is the one upset i guess. Nice Post
5 Motorcycle Lift // Aug 3, 2009 at 11:00 pm
I cant wait till march madness again! This past year i made a trip out of it and plan to do it again!
6 cojipokudy // Aug 21, 2009 at 11:43 am
cojipokudy…
The Rhinelander Daily News …
Leave a Comment