This Riddler problem is about the game of tennis!
Your wish has been granted, and you get to play tennis against Roger Federer in his prime in the Wimbledon final. You have only a 1 percent chance to win each point, but Roger, sporting gentleman that he is, offers to let you name any score and begin the match at that point. (So, if you’ve entertained a fantasy of storming back after being down three match points in the fifth set, now’s the time to live it.) What score can you name that gives you the best chance to win, and what is your chance of winning the title?
A rough (and approximate) solution:
[Show Solution]
Here is the rundown of how tennis is scored:
- A match is won by the first player to win three sets. Therefore up to five sets will be played. The first four sets are ordinary sets while the fifth set (if necessary) is an advantage set.
- An ordinary set is won by the first player to (i) win six ordinary games and (ii) have a two game lead. If the score ever gets to 6-6, then a special tiebreak game is played, and the winner of that game wins the set.
- An advantage set is like an ordinary set, but with no tiebreak. There is no limit on the number of games needed.
- An ordinary game is won by the first player to (i) win four points and (ii) have a two point lead. No limit on the number of points needed.
- A tiebreak game is won by the first player to (i) win seven points and (ii) have a two point lead. No limit on the number of points needed.
It may seem at first glance that the most advantageous score is to be up 2 sets to 0, to be winning the third set 5 games to 0, and to be winning the sixth game of that set 40-love (3 points to 0). If we’re going to beat Roger, it will most likely happen during that 40-love game. If Roger gets past this, he’ll almost certainly win the rest of the match. There are many ways Roger could win the 40-love game. He could win 5 point in a row (probability $(0.99)^5 \approx 0.951$). Or he could win three in a row, lose one of the next two games, then win two in a row (probability $2(0.01)(0.99)^6 \approx 0.019$), and so on. All other possibilities are increasingly unlikely so let’s stop here and call it roughly $0.970$. So we have roughly a 3% chance of beating Roger if we start with this score.
This is not the best starting score, however! As mentioned above, tiebreak games are played to seven points instead of four! So the most advantageous score is actually to be up 2 sets to 0, to have the third set tied 6 games to 6, and to be winning the tiebreak game 6-0. Then, using an argument similar to the one above, we can approximate Roger’s chances of winning as $(0.99)^8( 1 + 2(0.01)(0.99) ) \approx 0.941$.
We have roughly a 5.9% chance of beating Roger.
A more detailed (and exact) solution:
[Show Solution]
The fair case
We’ll start by calculating what happens in the fair case (where Federer doesn’t give you a head start). Let’s step aside from tennis for a second and imagine a generic game where a sequence of rounds is played. Each round, a point is awarded either to me with probability $q$, or to my opponent otherwise. We will play until one of us gets $m$ points and wins by two. In the event that we tie at $m-1$ points each, then we flip a weighted coin to break the tie. In such a coin flip, I win with probability $r$. Let $f_m(q,r)$ be the probability that I win this game.
The probability that I win with a score of $m$ to $k$ is given by ${m+k-1 \choose k} q^m (1-q)^k$, since I must win the final game and my opponent will win $k$ of the remaining $m+k-1$ games. The tie occurs with probability ${2m-2 \choose m-1} q^{m-1}(1-q)^{m-1}$ and in this case I win with probability $r$. So we have:
\[
f_m(q,r) = \sum_{k=0}^{m-2} {m+k-1 \choose k} q^m (1-q)^k + {2m-2 \choose m-1} q^{m-1}(1-q)^{m-1}r
\]
The first case of interest is when the game is tied (deuce), and we will play until one player wins by two. Let’s call $x_d,x_f,x_m$ the probability of winning given that the current score is “deuce”, “advantage Federer”, or “advantage me”, respectively. Also, suppose that the probability that I win one point is $p$. Then we can write the following sets of equations:
\begin{align}
x_d &= p x_m + (1-p) x_f \\
x_f &= p x_d \\
x_m &= p + (1-p) x_d
\end{align}
Therefore, the probability that I win a game if the score is deuce is:
\[
x_d(p) = \frac{p^2}{1-2p+2p^2}
\]
We can now build up the entire match one piece at a time.
- Each ordinary game is won at 4 points, win by 2, and if we tie at 3-3 then we are at deuce. So my probability of winning a game is:
\[
g = f_4(p,x_d(p))
\] - Each tiebreak game is won at 7 points, win by 2, and if we tie at 6-6 then we are at deuce. So my probability of winning a tiebreak game is:
\[
\bar g = f_7(p,x_d(p))
\] - Each ordinary set is won at 6 games, win by 2, and if we tie at 5-5, we play two more games. I can win by winning both extra games or by winning one of the two games and then winning a tiebreak game. So if we tie at 5-5, my chance of winning is $g^2 + 2g(1-g)\bar g$. So my overall chance of winning an ordinary set is:
\[
s = f_6(g, g^2 + 2g(1-g)\bar g)
\] - Each advantage set is won at 6 games, win by 2, and if we tie 6-6 then we are at deuce. So my probability of winning an advantage set is:
\[
\bar s = f_6( g, x_d(g) )
\] - The match is won at 3 sets, win by 2, and if we tie at 2-2 then we play an advantage set. So my probability of winning a match is:
\[
M = f_3( s, \bar s)
\]While it’s possible to actually calculate these quantities, the result is pretty messy. For those interested, $M(p)$ is a rational function whose numerator has degree 465 and denominator has degree 136. Yuck! So I won’t write it out… but here is what the functions look like as plots:
I didn’t include advantage sets on the plot because they have virtually the same distribution as ordinary sets. As expected, the picture is perfectly symmetric; if we have a 50% chance of winning every point, then we have a 50% chance of winning every game, every set, and the entire match. The shape also makes sense; if we are worse than our opponent, then winning a game or a set or a match becomes progressively less likely.
Here are some exact numbers for those who are interested:
P(win point) P(win game) P(win set) P(win match) 49% 47.5% 42.9% 36.8% 45% 37.7% 18.5% 4.61% 40% 26.4% 3.65% 0.04% 1% 1.50×10-7 2.35×10-39 1.30×10-115 So if your chance of winning one point is 1%, your chance of beating Federer is roughly the same as your odds of winning the Mega-Million jackpot 14 times in a row. Not good.
The unfair case
As discussed before, we’ll want to start with a 2-0 set lead in the match. The third set will be tied 6-6 and we’ll be up 6-0 in the tiebreak game.
To find our chance of winning in this case, we will instead calculate our chance of losing, because the algebra is simpler. We need to calculate:\begin{align}
&\mathbb{P}(\text{lose match})\\
&= \mathbb{P}(\text{lose tiebreak up 6-0}) \times \mathbb{P}(\text{lose match up 2-1}) \\
&= \mathbb{P}(\text{lose 6 points}) \times \mathbb{P}(\text{lose deuce}) \times \mathbb{P}(\text{lose 1 set}) \times \mathbb{P}(\text{lose 1 adv. set}) \\
&= (1-p)^6(1-x_d(p))(1-s)(1-\bar s)
\end{align}Once again, we can evaluate this quantity algebraically, and it turns out to be a mess… it’s a rational function of $p$ whose numerator has degree 182 and whose denominator has degree 60. But there is a trick: we can approximate $s\approx \bar s \approx 0$, i.e. we just give up if we don’t win that first tiebreak, and we get a much simpler formula:
$\displaystyle
\mathbb{P}(\text{win match}) \approx 1\, -\, \frac{(1-p)^8}{1-2p+2p^2}
$To verify that this approximation is a good one, I plotted both the true function and the approximation on the same axes:
Here is a plot showing the difference between the curves (on a log scale):
So if we have $p=0.01$ as specified in the problem, the approximation will be correct to 36 decimal digits! The final numerical result is:
$\displaystyle
\mathbb{P}(\text{win match}) = 5.86159\%
$So our original approximation of 5.9% wasn’t far off.
Great explanation Laurent! I had come to the conclusion that being up 2 sets and 6 points on the tie break is the most advantageous position to be in as well (but didn’t do any of the messy math). One of the things I was wondering about was whether or not being up 2-0 or 3-1 would be a preferable situation. I know the odds of winning unless if you don’t win the pre-arranged 6 match-point tie-break are infinitesimally small, but is there a difference between to two cases? I would imagine that a smaller number of remaining points favors the weaker player but I also might be completely wrong in that regard. Would love to hear if you come up with anything!
I don’t think you can be up 3 sets to 1, because in that case the match would already be over. The match ends once a player wins three sets.
Laurent
Sorry it’s been a while since I did my stats course so can you explain why my answer of using the binomial distribution (6C1 0.01^1 (1-0.01)^5 ) is wrong?
Thanks
The quantity you wrote down is the probability of winning one point and losing the other five. The reason this isn’t quite right is that you don’t always play six points. It’s match point, so if you win a point at all the game ends and you don’t play any more points.
The probability you’ll win the first point is just $0.01$. The probability you’ll win the second point is $(1-0.01)(0.01)$. The probability you’ll win the third point is $(1-0.01)^2(0.01)$. And so on. If you sum these up (geometric series), you get $1-(1-0.01)^6$, which makes sense because $(1-0.01)^6$ is the probability that you lose every point. One minus this is the probability that you don’t.
That makes sense.
Thanks for the explanation