counting - Book Proofs

Tower of goats

This week’s Riddler classic is a counting problem. Can the goats fit in the tower?

A tower has 10 floors, each of which can accommodate a single goat. Ten goats approach the tower, and each goat has its own (random) preference of floor. Multiple goats can prefer the same floor. One by one, each goat walks up the tower to its preferred room. If the floor is empty, the goat will make itself at home. But if the floor is already occupied by another goat, then it will keep going up until it finds the next empty floor, which it will occupy. But if it does not find any empty floors, the goat will be stuck on the roof of the tower. What is the probability that all 10 goats will have their own floor, meaning no goat is left stranded on the roof of the tower?

My solution:
[Show Solution]

Suppose there are $n$ goats and the building has $n$ floors. Let $a_i$ be the floor preference of goat $i$, with $i=1,\dots,n$. We will say a vector of floor preferences $v=(a_1,a_2,\dots,a_n)$ is “crowded” if it leads to exactly one goat on each floor.

Let $f(n)$ be the number of crowded floor preferences. Since there are $n^n$ total possible preference vectors (each of the $n$ goats can prefer $n$ different floors), the probability that each goat finds a floor is $f(n)/n^n.$ Therefore, solving the problem amounts to finding $f(n)$.

The following elegant counting argument is due to Pollak (1974). We start by considering a modified version of the problem:

Add one extra floor (the roof). The goats are allowed to pick this $(n+1)^\text{st}$ floor as their preferred floor.
Just like the other floors, the roof can accommodate at most one goat.
If a goat ends up on the roof and there is no space for it, the goat “wraps around”; it takes the elevator back down to the first floor and continues its search for an open floor.

In our modified version of the problem, each goat will eventually occupy a floor (possibly the roof), and there will be exactly one empty floor. If the empty floor is the roof, then no goat had the roof as its preferred floor, no goat ever visited the roof, and no goat ever had to take the elevator back down. Therefore, the vector of floor preferences was crowded according to our original definition. Conversely, if one of the other floors ends up empty, then a goat ended up on the roof, which means the original vector of floor preferences was either invalid (one of the goats picked the roof), or the preferences were valid but a goat nonetheless overflowed onto the roof.

So not only is $f(n)$ equal to the number of crowded preference vectors in the original problem, it is also equal to the number of preference vectors in the modified problem that lead to the roof being empty.

The final piece to this argument is to realize that there must be an equal number of preference vectors that lead to the $k^\text{th}$ floor being empty, for $k=1,\dots,n+1$. This follows because of the symmetry in the modified version of the problem. Specifically, if a particular preference vector $(a_1,\dots,a_n)$ leads to floor $k$ being empty, then $(a_1+j,\dots,a_n+j)$ leads to floor $k+j$ being empty (where we wrap all numbers around so they are in the range between $1$ and $n+1$).

Since there are $(n+1)^n$ total possible preference vectors for the modified problem (each of the $n$ goats can prefer $n+1$ different floors), and there are $n+1$ possible empty floors, we obtain
\[
f(n) = \frac{(n+1)^n}{n+1} = (n+1)^{n-1}
\]Converting this number into a probability as explained at the beginning of the solution, we find that the probability that $n$ goats have a crowded set of preferences is

$\displaystyle
p(n) = \frac{(n+1)^{n-1}}{n^n} = \frac{1}{n+1}\left(1+\frac{1}{n}\right)^n
$

For the case $n=10$, we can calcluate:
\[
p(10) = \frac{11^9}{10^{10}} = \frac{2{,}357{,}947{,}691}{10{,}000{,}000{,}000} \approx 23.58\%
\]

Since $\lim_{n\to\infty} \left(1+\frac{1}{n}\right)^n = e$, it follows that in the asymptotic limit of large $n$, we have $p(n) \sim \frac{e}{n+1}$. Here is a plot comparing the true probability to this asymptotic approximation.

About this problem’s history

This sort of counting problem, where you start with a preference and move to the next available open slot, is also known as a parking problem, because its original formulation was about cars trying to park on a one-way street and taking the next available open spot. There are many other equivalent counting problems, which you can find more about in the entry A000272 in the Online Encyclopedia of Integer Sequences. For a wonderful explanation of the parking problem and other related counting problems, take a look at Richard Stanley’s slides on the topic. The solution in my post above is based on the argument in Richard’s slides, which is due to Pollak (1974).

Squid game

This week’s Riddler Classic is Squid Game-themed!

There are 16 competitors who must cross a bridge made up of 18 pairs of separated glass squares. Here is what the bridge looks like from above:

To cross the bridge, each competitor jumps from one pair of squares to the next. However, they must choose one of the two squares in a pair to land on. Within each pair, one square is made of tempered glass, while the other is made of normal glass. If you jump onto tempered glass, all is well, and you can continue on to the next pair of squares. But if you jump onto normal glass, it will break, and you will be eliminated from the competition.

The competitors have no knowledge of which square within each pair is made of tempered glass. The only way to figure it out is to take a leap of faith and jump onto a square. Once a pair is revealed — either when someone lands on a tempered square or a normal square — all remaining competitors take notice and will choose the tempered glass when they arrive at that pair.

On average, how many of the 16 competitors will make it across the bridge?

Here is my solution.
[Show Solution]

Let’s consider a more general version of the game. Let $f(n,m)$ to be the expected number of competitors that make it across the bridge assuming there are $n$ total competitors and the bridge has $m$ unknown tiles remaining. If there are no competitors, then clearly none can make it across. Also, if there are no unknown tiles remaining, then all competitors make it through. This leads to the boundary conditions
\begin{align}
f(0,m) &= 0\quad\text{for }m=0,1,2,\dots \\
f(n,0) &= n\quad\text{for }n=0,1,2,\dots
\end{align}Now consider the general case with $n$ competitors and $m$ bridge tiles. Each time a competitor takes a turn, they will cross some number of tiles before they are eliminated. Let’s look at the possible cases for the first competitor.

With probability $1/2$, they are eliminated on the first tile. So the $n-1$ remaining competitors only have to contend with $m-1$ unknown tiles.
With probability $1/4$, they are eliminated on the second tile. So the $n-1$ remaining competitors only have to contend with $m-2$ unknown tiles.
Continuing in this fashion, with probability $1/2^m$, they are eliminated on the last (the $m^\text{th}$) tile, and the $n-1$ remaining competitors have $0$ unknown tiles to deal with.
Finally, with the remaining probability of $1/2^m$, the competitor guesses all tiles correctly, which means that all $n$ competitors will get across safely.

We can express these statements concisely in the following recursion.
\[
f(n,m) = \sum_{k=1}^m \frac{1}{2^k} f(n-1,m-k) + \frac{n}{2^m}
\]One way to simplify this expression is to multiply both sides by $2^m$ and define $g_n(m) := 2^m f(n,m)$. Then, the recursion becomes
\begin{align}
g_0(m) &= 0 \\
g_n(m) &= n + \sum_{k=0}^{m-1} g_{n-1}(k)\quad\text{for }n=1,2,\dots
\end{align}Applying this recursion for the first several steps, we obtain
\begin{align}
g_1(m) &= 1\\
g_2(m) &= 2+m\\
g_3(m) &= \tfrac{1}{2}(6+3m+m^2)\\
g_4(m) &= \tfrac{1}{6}(24+14m+3m^2+m^3)\\
g_5(m) &= \tfrac{1}{24}(120+70m+23m^2+2m^3+m^4)\\
g_6(m) &= \tfrac{1}{120}(720+444m+120m^2+35m^3+m^5)\\
g_7(m) &= \tfrac{1}{720}(5040+3108m+1024m^2+135m^3+55m^4-3m^5+m^6)\\
g_8(m) &= \dots
\end{align}From there, we can recover the expected number of winners via $f(n,m) = g_n(m) /2^m$. Although computing each subsequent function is straightforward, I could not find a closed-form expression for $g_n(m)$. Each is a polynomial of degree $n-1$, but beyond some obvious observation such as the constant term being $n$ and the common denominator being $(n-1)!$, I couldn’t find a general formula.

For the specific instance ($n=16$ and $m=18$) described in the problem statement, we can obtain the exact expected number of winners, and it is

$\displaystyle
f(16,18) = \frac{458757}{65536} = 7.0000762939453125 \text{ (exact)}
$

which is slightly larger than $7$.

Approximate (asymptotic) solution

We can approximate $f(n,m)$ as follows. Each tile will eliminate on average $\frac{1}{2}$ of a contestant. So if we start with $n$ contestants, roughly $n-\frac{1}{2}m$ contestants will cross the bridge. This leads to the approximation

$\displaystyle
f(n,m) \approx n-\tfrac{1}{2}m
$

This approximation doesn’t work when the number of contestants is small. For example, if $n\lt \frac{1}{2}m$, it would lead to a negative number of contestants crossing the bridge, which is of course impossible. But the approximation turns out to be pretty good when $n$ is larger. Applying the approximation to the specific instance in the problem statement, we find $f(16,18) \approx 16-\tfrac{1}{2}\cdot 18 = 7$, and this is very close to the true solution!

Here are some plots that confirm the formula; I plotted $f(n,m)$ for fixed values of $n$ and $m$.

And here is a much better solution!
[Show Solution]

This solution comes courtesy of comments by Guy D. Moore and MarkS.

As in the previous solution, suppose we have $n$ contestants and $m$ bridge tiles. Suppose $b$ tiles are broken during the course of the game. This occurs with probability $\frac{1}{2^m}\binom{m}{b}$, since each tile has a probability of $\frac{1}{2}$ of being guessed incorrectly. When $b$ tiles are broken, $n-b$ contestants cross the bridge. Finally, we can break at most $\min(n,m)$ tiles, since there are only $m$ tiles, and each of the $n$ players can break at most one tile. Therefore, the expected number of competitors to make it across the bridge is:

$\displaystyle
f(n,m) = \frac{1}{2^m}\sum_{b=0}^{\min(n,m)} \binom{m}{b}(n-b)
$

If the special case where $n\geq m$, the sum will go up to $m$, and we can evaluate it exactly, by writing:
\begin{align}
(1+x)^m &= \sum_{b=0}^m \binom{m}{b} x^{m-b} \\
x^{n-m}(1+x)^m &= \sum_{b=0}^m \binom{m}{b} x^{n-b} \\
(n-m)x^{n-m-1}(1+x)^m + m x^{n-m}(1+x)^{m-1} &= \sum_{b=0}^m \binom{m}{b} (n-b)x^{n-b-1}
\end{align}where in the last step, we differentiated both sides with respect to $x$. Now, setting $x=1$ on both sides and dividing by $2^m$, we obtain:
\[
\frac{1}{2^m}\sum_{b=0}^m \binom{m}{b} (n-b) = n-\frac{1}{2}m
\]So the “asymptotic” solution I found in my first solution isn’t just asymptotic; it’s exact when $n\geq m$.

When $n\lt m$, there is no closed-form expression for the sum. However, we can be smart about how we evaluate it. Since we know the sum of all $m$ terms, we only have to evaluate $\min(n,m-n)$ terms. This leads us to the complete solution:

$\displaystyle
f(n,m) = \begin{cases}
n-\frac{1}{2}m & \text{if }n \geq m \\
\frac{1}{2^m}\sum_{b=0}^n \binom{m}{b} (n-b) & n\lt m\\
n-\frac{1}{2}m-\frac{1}{2^m}\sum_{b=n+1}^m \binom{m}{b} (n-b) & n \lt m\text{ (alt.)}
\end{cases}
$

Interesting side-note

Guy D. Moore also pointed out that there is a simpler recurrence relation for $f(n,m)$, given by:
\[
f(n,m) = \frac{f(n,m-1) + f(n-1,m-1)}{2}
\]Using this recurrence with the generating function
\[
G(x,y) = \sum_{n=0}^\infty \sum_{m=0}^\infty f(n,m) x^n y^m,
\]it is possible to show that:
\[
G(x,y) = \frac{x}{(1-x)^2}\cdot \frac{2}{2-y(x+1)}
\]Therefore, $f(n,m)$ is the coefficient corresponding to the $x^n y^m$ term in the series expansion of $G(x,y)$ above! Here is an example of this working for $n=16$ and $m=18$ (as in the original problem) using WolframAlpha.

Flawless war

his week’s Riddler Classic has to do with the card game “War”. Here is the problem, paraphrased:

War is a two-player game in which a standard deck of cards is first shuffled and then divided into two piles with 26 cards each; one pile for each player. In every turn of the game, both players flip over and reveal the top card of their deck. The player whose card has a higher rank wins the turn and places both cards on the bottom of their pile. Assuming a deck is randomly shuffled before every game, how many games of War would you expect to play until you had a game that lasted just 26 turns (with no ties; a flawless victory)?

Here is my solution:
[Show Solution]

Suppose our cards have values $\{1,2,\dots,n\}$ and the deck contains $m_i$ cards with value $i$. The total number of cards in the deck is equal to $m = m_1 +\dots+m_n$, and we assume $m$ is even so that the deck can be evenly split in half. In a standard deck of playing cards, we have $n=13$ and $m_1=m_2=\dots=m_n = 4$.

Rather than thinking about “how many games we would expect to play before I had a flawless game”, let’s instead work out the probability that a game of War will be flawless for me (I wins in $m/2$ turns with no ties). This is a counting problem: how many ways are there of arranging the cards in the deck so that I win flawlessly?

To this effect, define $f(m_1,\dots,m_n)$ to be the number of ways I can win flawlessly if the cards remaining have distribution $(m_1,\dots,m_n)$. We will develop a recursive formulation of $f$.

If I pick “$i$” in the first round, I will win the round with no tie if my opponent picks “$j$” with $j \lt i$. This can happen in $m_i m_j$ ways, and then I can win the remaining rounds flawlessly in $f(\dots)$ ways, where the arguments $m_i$ and $m_j$ are each decremented by $1$. Therefore, we can write:
\[
f(m_1,\dots,m_n) = \sum_{1 \leq j \lt i \leq n} m_i m_j f(m_1,\dots,m_j-1,\dots,m_i-1,\dots,m_n)
\]We also have the terminal condition $f(0,\dots,0)=1$, for if we ever achieve this condition, there are no cards left and we have won!

We can make two critical observations that will greatly simplify computation of $f$:

We can prune out any zero entries. For example, $f(0,2,0,4,0,0) = f(2,4)$. This follows because when a particular card number is not present in the deck, we may relabel the numbers on the cards and simply skip over the missing number.
We can rearrange arguments. For example, $f(6,4,1,7) = f(1,4,6,7)$. This is due to the fact that our recursive equation is symmetric in the arguments (all pairs of indices are included in the sum and contribute similarly to the count) and our base case $f(0,\dots,0)$ is symmetric as well.

Equipped with these tools, we can write very efficient code to compute the solution for the case of a standard deck of cards. Here is an efficient recursive implementation in Python that computes the solution for a standard deck of cards.

from itertools import combinations

def memoize(f):
    memo = {}
    def helper(x):
        # prune zeros and sort!
        y = tuple( sorted([z for z in x if z > 0]) )
        if y not in memo:
            memo[y] = f(y)
        return memo[y]
    return helper

@memoize
def winning_hands(counts):
    N = len(counts)    
    if N == 0:  # base case (no cards left in the deck)    
        return 1
    
    out = 0
    for i1,i2 in combinations(range(N),2):
        tmp = list(counts)
        tmp[i1] -= 1
        tmp[i2] -= 1
        out += counts[i1] * counts[i2] * winning_hands(tmp)
    return out

winning_hands( 13 * [4] )

The code took about 100ms to compute the answer:
\[
252656585660288881535364185959261220490470307542335488000000
\]

We can divide this by the total number of hands (which is $m!$) to obtain the probability of winning flawlessly.

from math import factorial
from fractions import Fraction

Fraction( winning_hands(13 * [4]), factorial(52) )

And the result is:
\begin{align}
p &= \mathrm{Prob}(\text{flawless win}) \\
&= \frac{93686147409122073121}{29908397871631390876014250000} \\
&\approx 3.132436\times 10^{-9}
\end{align}

This is a very small number, but the more we play, the likelier we are to win. To quantify this, suppose we play $N$ games. The probability that we win flawlessly at least once is $1-(1-p)^N$. We can plot this as a function of $N$ to see how it grows:

So to have a $50\%$ chance of winning flawlessly at least once, I would have to play around 220 million times. To have a $95\%$ chance of winning flawlessly at least once, I would have to play around 1 billion times. Needless to say, that’s a lot of war.

Cutting a ruler into pieces

This week’s Riddler Classic is a paradoxical question about cutting a ruler into smaller pieces.

Recently, there was an issue with the production of foot-long rulers. It seems that each ruler was accidentally sliced at three random points along the ruler, resulting in four pieces. Looking on the bright side, that means there are now four times as many rulers — they just happen to have different lengths. On average, how long are the pieces that contain the 6-inch mark?

With four cuts, each piece will be on average 3 inches long, but that can’t be the answer, can it?

Here is my solution:
[Show Solution]

We will consider the following more general version of the problem.

Suppose a ruler of length $L$ is marked at a fraction “$a$” from one end of the ruler. Now suppose $N-1$ cuts are chosen uniformly at random along the length of the ruler, which splits the ruler into $N$ smaller pieces. What is the expected length of the piece that contains the mark?

In the original problem, $L=12\text{ inches}$, $a=\tfrac{1}{2}$, and $N=4$.

We will start by considering a simpler problem. Suppose we have $k$ numbers chosen uniformly at random and independently on the interval $[0,b]$. What is the expected value of $\min_{1\leq i \leq k} x_i$? We can compute this using the fact that:
\begin{align}
\mathbb{E}\left( \min_{1\leq i \leq k} x_i \right) &= \int_0^b \mathbf{Prob}( \min(x_i) \geq t )\,\mathrm{d}t \\
&= \int_0^b \mathbf{Prob}( x_1 \geq t, \dots, x_k \geq t )\,\mathrm{d}t \\
&= \int_0^b \mathbf{Prob}( x_1 \geq t) \cdots \mathbf{Prob}( x_k \geq t )\,\mathrm{d}t \\
&= \int_0^b \mathbf{Prob}( x_1 \geq t)^k\,\mathrm{d}t \\
&= \int_0^b \left( \frac{b-t}{b}\right)^k \mathrm{d}t \\
&= \frac{b}{k+1}
\end{align}The first equality follows from the fact that expectation can be written as the integral of the complementary cumulative distribution function (wiki link). Then, we use the fact that the $x_i$ are mutually independent and identically distributed.

Back to our original problem, we can split it into cases depending on how many cuts end up to the left vs how many end up to the right of our mark. Suppose we have $k$ cuts to the left of $a$, and the remaining $N-k-1$ cuts to the right. Then the expected length of the piece containing $a$ is the sum of the expected lengths of the “left part” and “right part”. Each of these pieces can be computed based on the preliminary result above, and we obtain:
\[
L\left(\frac{a}{k+1}+\frac{1-a}{N-k}\right)
\]Note that this formula gives the correct answer when $k=0$; it returns a length of $a$ for the left half (the whole interval). Next, we must compute the probability of this actually occurring; that exactly $k$ cuts are on the left and $N-k-1$ cuts are on the right. Since the probability of these events happening are $a$ and $1-a$, respectively, and they are mutually independent, we have a binomial distribution. The probability that $k$ cuts are to the left of $a$ is therefore:
\[
\binom{N-1}{k}a^k (1-a)^{N-k}
\]Combining these two facts, the expectation we seek to evaluate can be written as the sum:
\[
\sum_{k=0}^{N-1} L\left(\frac{a}{k+1}+\frac{1-a}{N-k}\right) \binom{N-1}{k}a^k (1-a)^{N-k-1}
\]We can split this into two separate sums. For the first one,
\begin{align}
&\sum_{k=0}^{N-1} \frac{L a}{k+1}\binom{N-1}{k}a^k (1-a)^{N-k-1}\\
&= \sum_{k=0}^{N-1} \frac{L}{N}\binom{N}{k+1}a^{k+1} (1-a)^{N-k-1} \\
&= \sum_{k=1}^{N} \frac{L}{N}\binom{N}{k}a^{k} (1-a)^{N-k} \\
&= \frac{L}{N}\left( 1-(1-a)^N \right)
\end{align}where we used the binomial theorem in the final step to evaluate the sum. Using a similar argument for the other half of the sum, we obtain $\frac{L}{N}\left( 1-a^N\right)$. Combining both parts, we find our final formula for the expected length of the piece containing the mark:

$\displaystyle
\frac{L}{N}\Bigl( 2-a^N-(1-a)^N \Bigr)
$

In particular, for the original problem statement, $L=12\text{ inches}$, $a=\tfrac{1}{2}$, and $N=4$. So the final answer is $\frac{45}{8}= 5.625\text{ inches}$. This is substantially longer than the average length of each piece, which is 3 inches.

Several interpretations of this fact were brought to my attention, so I will relay a couple here.
Commenter Guy D. Moore noted that this is an example of selection bias. My colleague Kangwook Lee also pointed out that this phenomenon is precisely the inspection paradox from probability theory.

Here is an intuitive explanation for why the true answer is so much larger than 3. The longer pieces of ruler are more likely to contain the 6-inch mark. In fact, any piece longer than 6 inches is guaranteed to contain the 6-inch mark! Although such pieces are rare, we are conditioning on the fact that our piece contains the 6-inch mark, so we are likelier to pick out these longer pieces and so the average piece length will be longer.

Limiting cases

As a sanity check, let’s see what happens when we consider the limiting behavior of our formula.

If $N=1$ (just one piece), the formula simplifies to $L$, which makes sense. In this case, no cuts are made, so all pieces contain the mark, and all pieces are of length $L$.
If $a=0$ or $a=1$ (the mark is at either end of the ruler), the formula simplifies to $\frac{L}{N}$, which is the average piece length. This makes sense because the first piece always contains the mark, and there is no reason the first piece should be any longer or shorter than the average piece.
If $N$ gets very large and $0\lt a\lt 1$, the formula tends to $\frac{2L}{N}$. So pieces containing the mark are on average twice as long as the average piece.

Here is a figure showing how much longer the marked piece of ruler is compared to the average length $\frac{L}{N}$. We can visually confirm the three limiting cases: when $N=1$, we just get the average length, if $a=0$ or $a=1$ we also get the average length, and as $N\to\infty$, we do indeed tend to $2$. The solution to the original problem ($a=\tfrac{1}{2}$ and $N=4$) produces a ratio of $1.875$, which when multiplied by the average length $\frac{L}{N} = \frac{12}{4} = 3$ produces the answer we reported above, $5.625$.

Connect the dots

This week’s Riddler Classic is a problem about connecting dots to create as many non-intersecting polygons as possible. Here is the problem:

Polly Gawn loves to play “connect the dots.” Today, she’s playing a particularly challenging version of the game, which has six unlabeled dots on the page. She would like to connect them so that they form the vertices of a hexagon. To her surprise, she finds that there are many different hexagons she can draw, each with the same six vertices.

What is the greatest possible number of unique hexagons Polly can draw using six points?

(Hint: With four points, that answer is three. That is, Polly can draw up to three quadrilaterals, as long as one of the points lies inside the triangle formed by the other three. Otherwise, Polly would only be able to draw one quadrilateral.)

Extra Credit: What is the greatest possible number of unique heptagons Polly can draw using seven points?

Here is my solution:
[Show Solution]

It turns out this is a well-studied and notoriously difficult problem in combinatorial geometry. While it is tempting to ask the same question for 8, 9, or even $n$ points, we’ll see that the problem gets very difficult very quickly, and the answer for general $n$ is not actually known!

First, we’ll get some terminology out of the way:

A complete graph is a graph with every possible edge drawn in. The complete graph with $n$ nodes is denoted $K_n$.
A Hamiltonian cycle is a path through the graph that finishes where it started and visits every node exactly once. For the graph $K_n$, there are $\frac{1}{2}(n-1)!$ possible Hamiltonian cycles since we can fix the starting node and then there are $n-1$ nodes left to order. We divide by two because each cycle is counted twice (can be traversed forwards or backwards).
A realization of a graph is a particular way of arranging the nodes. Realizations are important because we care about whether edges cross or not; sometimes two different arrangements of the same graph will have different numbers of edge crossings.
The rectilinear crossings of a graph realization $G$ is the number of times edges cross when we connect the nodes with straight lines. We’ll denote this number $\mathrm{rcr}(G)$.
The crossing-free Hamiltonian cycles of a graph realization $G$ is the number of different Hamiltonian cycles of $G$ that do not contain any edges that cross each other. We’ll denote this number $\mathrm{cfhc}(G)$. These are also called “spanning cycles” or “simple polygonalizations” depending on who you ask.

For a simple example, let’s start with $K_4$, the complete graph on 4 nodes. Although there are infinitely many ways to realize this graph (since we can arrange the 4 points in infinitely many ways), rearrangements of the points that preserve how the edges intersect one another are all equivalent for our purpose (this is an example of an equivalence class). So we only need to consider one representative from each class. These representative realizations are called order types. It turns out $K_4$ has two order types:

The first order type has no crossings ($\mathrm{rcr}(G)=0$) and three possible crossing-free Hamiltonian cycles. Here are the cycles:

The second order type has one crossing ($\mathrm{rcr}(G)=1$) and only one possible crossing-free Hamiltonian cycle:

The bad news…

Unfortunately, things get difficult from this point on as we add more nodes:

The number of order types for $K_n$ grows rapidly with $n$. For $n\ge 3$, they are: $\{1, 2, 3, 16, 135, 3315, 158817, 14309547, 2334512907,\dots\}$. This sequence is in OEIS. There is no known general formula.
The minimal rectilinear crossing number for $K_n$ over all possible realizations for $n\ge 3$ is: $\{0, 0, 1, 3, 9, 19, 36, 62, 102, 153,\dots\}$. This sequence is also in OEIS. No known formula known for this one either.
Finally, the minimal number of crossing-free Hamiltonian cycles for $K_n$ for $n \ge 3$ is: $\{1, 3, 8, 29, 92, 339, 1282, 4994,\dots\}$. And this is also in OEIS. You guessed it; no known formula.

These problems are related but different. For example, we might expect realizations with smaller crossing numbers to contain more crossing-free Hamiltonian cycles, since it is easier to avoid intersections when there are fewer of them. But this is not always the case. We might also expect realizations with more symmetry to contain more cycles; also not true in general. What I’m trying to get at is that there are no (known) tricks we can use to reduce this problem to a simpler one. Unfortunately, it appears the only way to solve such problems is to enumerate all order types (even this is difficult!), then enumerate all possible cycles, keeping only the ones that are crossing-free.

Many other variants are just as difficult: computing the crossing number (with curved edges allowed), the number of triangulations, the minimum number of convex polygons needed for a decomposition of the convex hull, and more. These sorts of problems belong to a branch of mathematics called combinatorial geometry. A great deal of research on the topic of order types, crossing numbers, and more, has been conducted by Prof. Oswin Aichholzer and colleagues. If you’re interested in learning more, I recommend checking out his webpage here, which contains an up-to-date database of solutions to many of these and related problems for small-ish $n$.

Visualizing the solutions

I used the point layouts provided on Prof. Aichholzer’s website and wrote some Python code to visualize the results. As a warm-up, I started with the case $K_5$.

The order types are sorted by their rectilinear crossing number. The one with the most crossing-free Hamiltonian cycles is the first one. Here they are:

Six nodes

Here are the 16 order types for $K_6$:

Here are the 29 crossing-free Hamiltonian cycles for the maximal configuration:

Seven nodes

Here are the 135 order types for $K_7$. Interestingly, there are three different realizations for the minimal rectilinear crossing number of $9$, and the most symmetric one (the first one) is not the one with the most crossing-free Hamiltonian cycles!

And here are the 92 crossing-free Hamiltonian cycles for the maximal configuration:

More nodes

The solutions get too large to visualize beyond 7 nodes, but here are the results for $n \leq 10$, courtesy once again of Prof. Aichholzer’s webpage. “CFHC” stands for “crossing-free Hamiltonian cycles”.

Number of nodes	Order types	Max CFHC
3	1	1
4	2	3
5	3	8
6	16	29
7	135	92
8	3,315	339
9	158,817	1,282
10	14,309,547	4,994

The Python code I wrote to produce all the figures is available here.

Mismatched socks

This week’s Riddler Classic is a problem familiar to many…

I have $n$ pairs of socks in a drawer. Each pair is distinct from another and consists of two matching socks. Alas, I’m negligent when it comes to folding my laundry, and so the socks are not folded into pairs. This morning, fumbling around in the dark, I pull the socks out of the drawer, randomly and one at a time, until I have a matching pair of socks among the ones I’ve removed from the drawer.

On average, how many socks will I pull out of the drawer in order to get my first matching pair?

Here is my solution:
[Show Solution]

Computing the expectation

Define $T$ to be number of turns that the game lasts (this is a random variable). Define $q_k = \textbf{P}(T > k)$ to be the probability that the game has still not ended after $k$ draws. We can compute this explicitly. There are $\binom{2n}{k}$ possible (equally likely) ways of choosing $k$ socks among the $2n$ socks in the drawer. If there is no match, we must have drawn $k$ socks of different colors. There are $\binom{n}{k}$ ways to choose the $k$ colors and $2^k$ ways to pick the socks once the colors are chosen. Putting everything together and performing some algebraic manipulations, we obtain:
\[
q_k = \frac{\binom{n}{k} 2^k}{\binom{2n}{k}}
= \frac{n! \cdot (2n-k)! \cdot 2^k}{(2n)! \cdot (n-k)!}
= \frac{ \binom{2n-k}{n} 2^k }{\binom{2n}{n}}
\]Let’s also define $p_k = \textbf{P}(T=k)$ to be the probability that the game ends on precisely the $k^\text{th}$ draw. Note that $q_{k} = p_{k+1}+p_{k+2}+\cdots+p_{n+1}$, or alternatively, $p_k = q_{k-1}-q_k$. From the definition of expected value:
\[
\textbf{E}(T) = \sum_{k=1}^{n+1} k\,p_k = \sum_{k=1}^{n+1}k(q_{k-1}-q_k) = \sum_{k=0}^n q_k
\]This is a manifestation of the general fact that $\textbf{E}(T) = \sum_{k\ge 0}\textbf{P}(T > k)$. Make the change $k\mapsto n-k$ in the expectation:
\[
\textbf{E}(T) = \frac{\sum_{k=0}^n \binom{2n-k}{n} 2^k}{\binom{2n}{n}}
= \frac{2^n \sum_{k=0}^n \binom{n+k}{k} 2^{-k}}{\binom{2n}{n}}
\]Consider the sum in the numerator. Define:
\[
S_n = \sum_{k=0}^n \binom{n+k}{k} 2^{-k}
\]We can evaluate this sum recursively:
\begin{align}
S_{n+1} &= \sum_{k=0}^{n+1} \binom{n+k+1}{k} 2^{-k} \\
&= \sum_{k=0}^{n+1}\left[ \binom{n+k}{k} + \binom{n+k}{k-1} \right] 2^{-k} \\
&= \sum_{k=0}^{n+1} \binom{n+k}{k}2^{-k} + \sum_{k=0}^{n+1} \binom{n+k}{k-1} 2^{-k} \\
&= S_n + \binom{2n+1}{n+1}2^{-n-1} + \sum_{k=0}^{n} \binom{n+k+1}{k} 2^{-k-1} \\
&= S_n + \tfrac{1}{2}S_{n+1}
\end{align}Solving, we obtain $S_{n+1} = 2S_n$, and therefore conclude that $S_n = 2^n$. Substituting this result back into the formula we had above, we obtain:

$\displaystyle
\textbf{E}(T) = \frac{4^n}{\binom{2n}{n}}
$

To get a feel for how this expression varies with $n$, we can use Stirling’s approximation, which leads us to the approximation:
\[
\textbf{E}(T) \approx \sqrt{\pi n}
\]So the expected number of socks we’ll have to draw grows as the square root of the number of pairs. For example, when we have $n=10$ pairs of socks, $\textbf{E}(T) = \frac{262144}{46189}\approx 5.67546$. Meanwhile, the approximation yields $\sqrt{10\pi} \approx 5.60499$. The approximation gets better as $n$ gets larger. Here is a plot showing the true expected value and the approximation:

Computing the distribution

We have the expectation $\textbf{E}(T) = \frac{4^n}{\binom{2n}{n}} \approx \sqrt{\pi n}$. But what about the distribution of $T$ itself? Does it approach anything interesting as $n$ gets large? First, we need to compute the exact probability mass function, which is nothing other than $p_k$. We can do this from the recursion we derived earlier: $p_k = q_{k-1}-q_k$. This simplifies to:
\[
p_k = \frac{k-1}{n-k+1} \frac{\binom{2n-k}{n} 2^{k-1} }{\binom{2n}{n}}\qquad\text{for }k=1,2,\dots,n+1
\]We can plot these distributions along with their means to see what it looks like. Here are the distributions of $T$ for $n=\{10,20,50,100\}$.

To understand what this distribution looks like as $n$ gets very large, we have to do a bit of asymptotic analysis. This part is a little rough around the edges (read: not particularly rigorous), but it seems to give a reasonable answer. My reasoning was that as $n\to\infty$, $k$ is small compared to $n$ because the mean grows like $\sqrt{n}$. Therefore, we will assume $k \sim \sqrt{n}$.

Using Mathematica, we can look at how $\frac{p_k}{k-1}$ varies for small $k$. Using a series approximation about $k=0$ and $n=\infty$, we find that:
\[
\log\left( \frac{p_k}{k-1} \right) \approx \log(\tfrac{1}{2n}) + O(\tfrac{1}{n}) + \tfrac{5}{4n} k-\tfrac{1}{4n}k^2+O(k^3)
\]Therefore, we conclude that:
\[
p_k \approx \frac{k}{2n} e^{-\tfrac{k^2}{4n}}
\]This means that as $n\to\infty$, the distribution of stopping time $T$ approaches a Rayleigh distribution with parameter $\sigma = \sqrt{2n}$. This also gives the correct mean for large $n$, as the mean of the Rayleigh distribution is $\sigma\sqrt{\frac{\pi}{2}} = \sqrt{\pi n}$. For a comparison, here is the plot for $n=500$ along with the Rayleigh approximation:

Gift card puzzle

Here is a puzzle from the Riddler about gift cards:

You’ve won two gift cards, each loaded with 50 free drinks from your favorite coffee shop. The cards look identical, and because you’re not one for record-keeping, you randomly pick one of the cards to pay with each time you get a drink. One day, the clerk tells you that he can’t accept the card you presented to him because it doesn’t have any drink credits left on it.

What is the probability that the other card still has free drinks on it? How many free drinks can you expect are still available?

Here is my solution:
[Show Solution]

Let’s suppose each card starts with $n$ drinks. There are three ways the sequence of buying drinks can end:

The first card gets maxed out and the second card has $k$ drinks remaining with $k\ge 1$. This means we purchased a total of $2n-k$ drinks and $n$ of them ended up on the first card. Then, we tried to purchase one additional drink on the first card and the buying stopped. The probability of this occurring is: $(\tfrac{1}{2})^{2n-k+1}\binom{2n-k}{n}$.
both cards get maxed out and then we try to buy one more drink on either card. This means we purchased a total of $2n$ drinks and $n$ of them ended up on the first card. The probability of this occurring is $(\tfrac{1}{2})^{2n}\binom{2n}{n}$.
The second card gets maxed out and the first card has $k$ drinks remaining with $k\ge 1$. This is exactly analogous to the first case, and the probability of this occurring is again: $(\tfrac{1}{2})^{2n-k+1}\binom{2n-k}{n}$.

Putting all of this together, we end up with a tidy formula for $p(n,k)$, the probability that when the game ends, the other card has exactly $k$ drinks remaining on it:

$\displaystyle
p(n,k) = \frac{1}{2^{2n-k}} \binom{2n-k}{n}
$

To double-check that this is a valid probability mass function, we should have $\sum_{k=0}^n p(n,k) = 1$. This is indeed the case, but it’s rather challenging to verify. Here is a link to the WolframAlpha computation. Here is plot of the probability mass function for $n=50$.

We can now answer the first question: the probability that the other card still has drinks on it is one minus the probability that there are no drinks left, i.e. $1-p(n,0)$. This evaluates to:

$\displaystyle
\mathrm{Prob}\Bigl(\begin{smallmatrix}\text{there are drinks left}\\\text{on the other card}\end{smallmatrix}\Bigr)
= 1-\frac{1}{4^n} \binom{2n}{n} \approx 1-\frac{1}{\sqrt{\pi n}}
$

This makes sense; the only way there can be no drinks left is if we purchased $2n$ drinks, and we were lucky to use each card exactly $n$ times. We would expect this probability to get smaller as $n$ gets larger. The approximation I used above comes from Stirling-like approximations for binomial coefficients (see here). When $n=50$ as in the problem statement, the probability evaluates to $92.04\%$. Here is a plot showing how the probability of there being drinks on the other card slowly tends to $1$ as $n$ gets larger:

Now let’s address the second question: what is the expected number of drinks on the other card? We will compute the expected value directly from the probability mass function: $E_n = \sum_{k=0}^n k\, p(n,k)$. We find:

$\displaystyle
\mathbb{E}\Bigl(\begin{smallmatrix}\text{number of drinks left}\\\text{on the other card}\end{smallmatrix}\Bigr)
= \frac{2n+1}{4^n}\binom{2n}{n}-1 \approx \frac{2n+1}{\sqrt{\pi n}}-1
$

Again, the derivation is challenging so I omitted it, but here is the WolframAlpha link. Here is a plot of this function as it changes with $n$ along with the approximation (which is quite good!)

When $n=50$ as in the problem statement, the expected number of drinks remaining on the other card is $7.0385$. This corresponds to the expected value of the probability distribution plotted in the first image.

Elf music

This holiday-themed Riddler problem is about probability:

In Santa’s workshop, elves make toys during a shift each day. On the overhead radio, Christmas music plays, with a program randomly selecting songs from a large playlist.

During any given shift, the elves hear 100 songs. A cranky elf named Cranky has taken to throwing snowballs at everyone if he hears the same song twice. This has happened during about half of the shifts. One day, a mathematically inclined elf named Mathy tires of Cranky’s sodden outbursts. So Mathy decides to use what he knows to figure out how large Santa’s playlist actually is.

Help Mathy out: How large is Santa’s playlist?

Here is my solution:
[Show Solution]

Hand sort

A card-rearranging problem on the Riddler blog. Here it goes:

You play so many card games that you’ve developed a very specific organizational obsession. When you’re dealt your hand, you want to organize it such that the cards of a given suit are grouped together and, if possible, such that no suited groups of the same color are adjacent. (Numbers don’t matter to you.) Moreover, when you receive your randomly ordered hand, you want to achieve this organization with a single motion, moving only one adjacent block of cards to some other position in your hand, maintaining the original order of that block and other cards, except for that one move.

Suppose you’re playing pitch, in which a hand has six cards. What are the odds that you can accomplish your obsessive goal? What about for another game, where a hand has N cards, somewhere between 1 and 13?

Here is my solution:
[Show Solution]

I was unable to find an analytic or closed-form expression for the solution to this problem, so I’ll present an old fashioned brute-force solution. The idea is simple:

Enumerate all possible hands
For each hand, enumerate all possible moves that consist of moving some group of adjacent cards to somewhere else in the hand.
If any such move succeeds in sorting the hand, mark the hand as “good”.
Once we’re done, count all “good” hands and divide that by the total number of hands

Simple, right? I’d like to point out is that this is not an approximate solution; there is no simulation involved. I’m counting all the possible scenarios, so this approach produces the exact answer. The only problem is that there are a lot of hands to check, and the bigger the hands get, the longer it takes to check them, since there are more possible moves. Successfully solving this problem requires coding it up in a way that a computer can find the solution in a reasonable amount of time. In order to make this work, I used several tricks to reduce the computation required:

We don’t have to enumerate all possible hands because only the suits matter, but we do have to be careful about how we count. We begin with a full deck of cards (52 distinct cards). Let’s suppose we want to count hands of size 2. There are $52\cdot 51 = 2652$ possible hands. But since numbers don’t matter, we can remove the number and just look at suits. There are then 16 possible hands:
\[
\begin{array}{cccc}
♠♠ & ♠♣ & ♠\color{red}{♥} & ♠\color{red}{♦} \\
♣♠ & ♣♣ & ♣\color{red}{♥} & ♣\color{red}{♦} \\
\color{red}{♥}♠ & \color{red}{♥}♣ & \color{red}{♥}\color{red}{♥} & \color{red}{♥}\color{red}{♦} \\
\color{red}{♦}♠ & \color{red}{♦}♣ & \color{red}{♦}\color{red}{♥} & \color{red}{♦}\color{red}{♦}
\end{array}
\]suppose for example that the hands with different suits but same color are not sortable. There are four such hands: $\{ ♠♣, ♣♠, \color{red}{♥}\color{red}{♦}, \color{red}{♦}\color{red}{♥} \}$. Then we might conclude that $\tfrac{4}{16} = 0.25$ of hands are not sortable. But this would be wrong! (thanks to Guy Moore for pointing this out). The reason is that these 16 hands don’t all occur with equal probability. There are $13\cdot 13 = 169$ ways of picking two cards of different suits, but only $13\cdot 12 = 156$ ways of picking two cards of the same suit! So each type of card should be weighted by its likelihood of occurrence. This means the probability of picking an unsortable hand should really be:
\[
\frac{4\cdot 169}{4\cdot 156 + 12\cdot 169} = \frac{13}{51} \approx 0.2549
\]So we can collapse the large number of possible hands to this more manageable size as long as we compensate by appropriately weighting the different items. In combinatorics, this collapsed group of cards (where we care about the order and the suit, but not the number of the card) is an example of a multiset permutation.

Note: If we don’t weigh the hands according to likelihood (i.e. assume all hands are equally likely), this is equivalent to assuming the deck has infinitely many cards (but the same number of cards in each suit).
Adjacent cards of the same suit can be collapsed to a single card. This is because there is never any benefit to moving a block of cards if it splits an existing contiguous block. So for example:
\[
\{♠,♠,\color{red}{♥},\color{red}{♥},\color{red}{♦}\} \implies \{♠,\color{red}{♥},\color{red}{♦}\}.
\]This greatly reduces the computations because if two different hands collapse to the same reduced hand, we only need to do the work for one of them! In computer programming, this technique of storing the values of computations so that you don’t end up computing the same thing many times is called memoization.
If a hand consists of $N$ cards (after we collapse adjacent suit repetitions as described above), then there are $\binom{N+1}{3}$ possible moves we can make. This is because every move is characterized by three locations. I’ll illustrate this with an example. Say our hand looks like this: $\{♠,\color{red}{♥},\color{red}{♦},♠,\color{red}{♥},♣\}$ and we want to sort it by moving cards 4 and 5 and inserting them between cards 1 and 2. In other words, we want to perform:
\[
\{♠,\color{red}{♥},\color{red}{♦},(♠,\color{red}{♥}),♣\} \implies \{♠,(♠,\color{red}{♥}),\color{red}{♥},\color{red}{♦},♣\}
\implies \{♠,\color{red}{♥},\color{red}{♦},♣\}
\]where we did one final collapse at the end. This transformation can be represented by inserting three “bars” between cards as follows. Then the swap simply consists of exchanging the cards between the two pairs of bars! Here is the diagram:
\[
\{\,♠\,\,|\underbrace{\color{red}{♥}\,\,\color{red}{♦}}_\text{swap this}|\underbrace{♠\,\,\color{red}{♥}}_\text{with this}|\,\,♣\,\}
\implies \{♠, ♠, \color{red}{♥},\color{red}{♥},\color{red}{♦},♣\}
\]Each triplet of bars corresponds to a possible swap, and we can insert bars in $N+1$ possible spots (in between each card and on either end).
If the hand has 8 or more cards in it after we perform the collapse, then it’s impossible to sort it in one move. This is because our final hand can have at most 4 groups of cards in it, and a single move can cause at most 3 collapses. With 8 cards, we can get down to 5 but not 4. Here is an example of a 7-card hand that can be sorted in one move:
\[
\{\,♠\,|\,\color{red}{♥}\,♣\,|\,♠\,\color{red}{♥}\,|\,♣\,\color{red}{♦}\,\}
\implies \{\,♠\,♠\,\color{red}{♥}\,\color{red}{♥}\,♣\,♣\,\color{red}{♦}\,\}
\implies \{\,♠\,\color{red}{♥}\,♣\,\color{red}{♦}\,\}
\]

Numerical results

There was a bit of confusion as to how to interpret the meaning of the requirement “the cards of a given suit are grouped together and, if possible, such that no suited groups of the same color are adjacent”. One way to interpret this statement is that we only require cards of a different suit but same color to be separated if it’s actually possible to separate them. For example, the two-card hand $\{\color{red}{♥}\,\color{red}{♦}\}$ counts as being sorted because there are no spades or clubs that can be placed between the heart and the diamond. In this scenario, here are the results:

Note that hands of up to size $N=4$ are always sortable in one move. The exact solution for the case $N=6$ using this interpretation is $\tfrac{51083}{83895} \approx 60.8892\%$. Another way to interpret the requirement is that cards of a different suit but same color can never be adjacent. This means that hands like $\{\color{red}{♥}\,\color{red}{♦}\}$ are never sortable no matter what you do. In this case, we get the result:

It’s interesting to note that the probability of a “sortable” hand in this interpretation actually reaches a maximum when $N=4$ and then decreases afterwards. The exact solution for the case $N=6$ using this interpretation is $\tfrac{1735996}{2936325} \approx 59.1214\%$.

I wrote my code in Julia, and you can view my notebook here. The computation gets slower in an exponential fashion as we increase $N$. The cases $N \leq 8$ take on the order of seconds, but computation time quadruples every time $N$ increases by 1. The last case, $N=13$, took about 40 minutes! While there is a significant amount of computation to do, there is also significant computational overhead due to using exact arithmetic instead of making approximations. For this, I used Julia’s BigInt datatype. So the solutions I obtained were exact. For example, the probability of a sortable hand for $N=13$ (using the first interpretation) turns out to be:
\[
\tfrac{10954472929065768960}{3954242643911239680000} = \tfrac{30785713171}{11112737293000} \approx 0.27703\%
\]Using approximations the whole way leads to a computation time of less than 10 minutes for the most complicated case.

Sniff out the spies

This interesting problem appeared on the Riddler blog. Here it goes:

There are N agents and K of them are spies. Your job is to identify all the spies. You can send a given number of agents to a “retreat” on a remote island. If all K spies are present at the retreat, they will meet to strategize. If even one spy is missing, this spy meeting will not take place. The only information you get from a retreat is whether or not the spy meeting happened. You can send as many agents as you like to the retreat, and the retreat can happen as many times as needed. You know the values of N and K.

What’s the minimum number of retreats needed to guarantee you can identify all K spies? If each retreat costs \$1,000 per person, what is the total cost to identify all K spies?

To begin with, let’s assume you know that N = 1,024 and K = 17.

Here is my solution for $K=1$:
[Show Solution]

The case $k=1$

Let’s warm up with the simple case $k=1$. We’ll call $r_1(n)$ the minimum number of retreats needed to identify a single spy in a group of $n$ agents. If we choose to send $m$ of the $n$ agents on a retreat, then either a meeting took place, which means the spy is among those $m$, or no meeting took place, so the spy must be among the $n-m$ that didn’t go on the retreat. Depending on the result, we’ll either need $r_1(m)$ or $r_1(n-m)$ more retreats. We can therefore write this little (dynamic programming) recursion:
\begin{align}
r_1(1) &= 0 \\
r_1(n) &= \underset{m\in\{1,\dots,n-1\}}{\text{minimize}}\,\, 1+\text{max}\{ r_1(m), r_1(n-m) \}
\qquad\text{for }n=2,3,\dots
\end{align}The reason for the “max” is that we want a guarantee that this number of retreats will always find the spy. Therefore, we assume the spy is wherever they need to be so that it takes as long as possible to find them. From here, it’s clear that we should always split in half. Therefore,
\begin{align}
r_1(1) = 0\quad\text{and}\quad r_1(n) = 1 + r_1\bigl(\lceil \tfrac{n}{2} \rceil\bigr)
\quad\text{for }n=2,3,\dots
\end{align}Where $\lceil x \rceil$ means that we round $x$ up to the nearest integer. By inspection, we can solve this recursion and we find that:

$\displaystyle
\text{number of retreats} = r_1(n) = \lceil \log_2(n) \rceil
$

What about the cost? We can simply count the total number of retreat attendees, which we’ll call $a_1(n)$. When we split $n$ in half, we can choose to send either $\lceil \tfrac{n}{2} \rceil$ or $\lfloor \tfrac{n}{2} \rfloor$ (round up or round down) to the retreat. Both give us the same information, so we’ll choose to send fewer agents. The number of attendees therefore satisfies:
\begin{align}
a_1(1) = 0\quad\text{and}\quad a_1(n) = \lfloor \tfrac{n}{2} \rfloor + a_1\bigl(\lceil \tfrac{n}{2} \rceil\bigr)
\quad\text{for }n=2,3,\dots
\end{align}Again, by inspection, we find that the number of attendees is:

$\displaystyle
\text{number of attendees} = a_1(n) = n-1
$

Note that we could have achieved the same total cost by sending the agents one-by-one on individual retreats. By the time we’ve sent all but one agent, we must know who the spy is. However, we were asked to minimize the number of retreats, so it would be wasteful to have this many retreats.

And here is a partial solution for $K \gt 1$:
[Show Solution]

The case $k\gt 1$

The case $k=1$ was all about finding one spy. There were $n$ possibilities (each of the $n$ agents could be the spy) and each time we had a retreat, we received one bit of information (yes/no). The best we could do was use that bit to cut our possibilities by a factor of two, and this was achievable by sending half of the agents on the retreat each time.

If $k \gt 1$, we are tasked with finding $k$ spies, and there are now $\binom{n}{k}$ possibilities (choosing $k$ spies out of $n$ agents). Again, the logic is the same: the best we can hope for with one bit of information is to reduce the possibilities by a factor of two. If we call $r_k(n)$ the minimum number of retreats required to identify the spies with certainty, then this argument yields the bound:

$\displaystyle
\text{number of retreats} = r_k(n) \ge \left\lceil \log_2\binom{n}{k} \right\rceil
$

If $n=1024$ and $k=17$, this yields the bound $r_{17}(1024) \ge 122$. The question then becomes: how close can we actually come to reaching this bound? One possible approach is to be greedy: we keep track of all possible remaining subsets, then we consider all possible retreat arrangements and use the one that reduces the number of remaining subsets by as much as possible (which can never be more than than a factor of two). We continue in this fashion until we have reduced the number of possible subsets to one, and that must be our set of spies.

While this greedy approach would likely be optimal, it’s computationally intractable. There are $\binom{1024}{17} \approx 3.68\times 10^{34}$ possible subsets of spies and there are $2^{1024} \approx 1.8\times 10^{308}$ possible retreats we could arrange at every step. So there is no hope of computing the optimal greedy strategy and seeing how well it performs compared to our lower bound.

We can think of this as a game, where I’m trying to guess the spies and the adversary is trying to arrange the spies such that it’s as difficult as possible for me to guess. So this becomes a philosophical question: if I pick a split such that there is a slight advantage for my adversary to say “yes, there was a meeting”, then as long as they keep saying “yes”, the solution is recursive and easily computable. But if my adversary knows that the problem gets very hard whenever they say “no, there was no meeting”, they they might just do something “suboptimal” with the goal of running me out of memory.

Suboptimal heuristic

The greedy heuristic seems out of reach, so here is another heuristic, which was truly a team effort — discussions with my colleague Alberto Del Pia, comments on this post by Jim Crimmins, Adam, and Jason Weisman, as well as a little bit of effort on my end. The idea is to split the $n$ agents into $m$ groups of roughly equal size, where $k+1 \le m \le n$. Let’s number the groups $1,\dots,m$. Here is the plan:

Take
Group 1 stays home, all other groups go on a retreat.
- If there was a meeting, then Group 1 has no spies in it.
- If there was no meeting, then Group 1 has at least one spy in it.
Return to step 1, but this time, Group 2 stays home and all other groups go on a retreat.
Keep going in this fashion until we have identified all groups with at least one spy in them. There can be at most $k$ such groups. Merge these groups (this becomes our new $n$), discard all other groups, and repeat the entire process with the new smaller group: picking $m$, dividing into groups, etc.

Note that we do not need to try all $m$ possible retreats in each round! As soon as there are $k$ retreats with “no meeting”, then we can stop and immediately proceed to the next round, since we now know which $k$ groups contain spies. In the worst case, this won’t happen. We will always be forced to have as many retreats as possible. By the time we have done $m-1$ retreats, two things can happen:

We have identified $k$ spies, in which case we can carry over our $k$ spy-containing groups to the next round immediately.
We have identified $k-1$ spies, in which case we can just carry over the last group to the next round without testing it.

In either case, we only require $m-1$ retreats and we always send $k$ groups to the next round. So there is never any need to have all $m$ retreats in a given round. To see what happens at each round, note that if we divide $n$ into $m$ groups, then to figure out the group sizes, let $q$ and $r$ be the quotient and remainder, respectively, when we divide $n$ by $m$. So $n = qm + r$, with $0\le r\le m-1$. Then $n$ will be divided into $m$ groups as follows:
\[
n = \underbrace{q+\cdots+q}_{m-r} + \underbrace{(q+1) + \cdots + (q+1)}_{r}
\]But which groups carry over to the next round? Since we get to choose the order in which we schedule the retreats, and not all groups are of equal size, we can ensure that the last group (corresponding to the last retreat that never happens) is as small as possible. But it’s also possible that our $(m-1)^\text{th}$ group gets sent to the next round instead. So our best bet is to keep the two smallest groups for last. Ultimately, the worst case will see the $k-1$ largest groups and the $2^\text{nd}$ smallest group sent to the next round. Let’s call $g(n,m)$ this number. It’s somewhat annoying to compute $g(m,n)$, since it depends on the relative size of $k$ and $r$, but it’s approximately equal to $kn/m$. Here is what the recursion looks like:
\[
r_k(k) = 0\qquad\text{and}\qquad
r_k(n) = \underset{m\in\{k+1,\dots,n\}}{\text{minimize}}\,\, \left( m-1 + r_k( g(n,m) ) \right)
\]The recursion can be solved in a straightforward manner using recursion. Here is some Julia code along with the full computation of $g(n,m)$ that does the job:

nmax = 1024
k = 17

memo = -1 + zeros(Int,nmax)

# if n agents are divided into m groups, pick k-1 largest and 2nd smallest
function g(n,m,k)
    # n = q*(m-r) + (q+1)*r
    q,r = floor(Int,n/m), rem(n,m)
    if m == k+1  # we must pick the k largest in this case
        (r >= k ? k*(q+1) : r*(q+1) + (k-r)*q)
    else  # pick the k-1 largest and the 2nd smallest in this case
        (r >= k-1 ? (k-1)*(q+1) : r*(q+1) + (k-1-r)*q) + (m-r >= 2 ? q : q+1)
    end
end

function rk(n)
    if memo[n] != -1
        memo[n]
    else
        memo[n] = ( n == k ? 0 : minimum([m-1+rk(g(n,m)) for m=k+1:n]) )
    end
end
        
rk(1024)

In the first round, we have $n=1024$ suspects. The optimal thing to do is to choose $m=43$ groups, leaving us with $407$ suspects. Then we choose $51$ groups to get down to $136$ suspects. Then choose $46$ groups again to get down to $50$ suspects. Finally, we choose $50$ groups and we’re down to $17$ suspects, which must be our spies. So the total is $(43-1)+(51-1)+(46-1)+(50-1)=186$ total retreats. Interestingly, if we use the approximation $g(n,m) \approx \lceil \tfrac{kn}{m} \rceil$, we also get the result of $186$. Together with our previously derived lower bound, we obtain:

$\displaystyle
\text{number of retreats}:\quad 122 \leq r_{17}(1024) \leq 186
$

The lower bound means that it’s impossible for any strategy to do better than this, but there may not be any strategies that are this good. The upper bound says it’s definitely possible to do this well using a particular strategy, but other strategies might do better. In this case, the ratio between our upper and lower bounds is about 1.52. This sort of ratio is used in approximation theory to quantify the quality of a proposed solution when the problem is hard to solve exactly.

How about the cost? We will approximate. For each round, in the worst case, we have $m$ retreats, where $k$ of them have meetings and $m-k$ do not. Whenever there is a meeting, the group that stayed home can be excluded forever after (since we know it contains no spies), so we don’t need to send that group on subsequent retreats. The most expensive thing that can happen is that our $k$ retreats with meetings come last. So the total number of attendees will be approximately:
\begin{align}
a(n,m) &\approx (m-1-k)\left(n-\tfrac{n}{m}\right) + \sum_{j=1}^k \left(n-j\tfrac{n}{m}\right)\\
&\approx \frac{n \left(k-k^2+2 (m-1)^2\right)}{2 m}
\end{align}So if we end up choosing $m$ groups when there are $n$ attendees, we should add $a(n,m)$ to our running count of attendees. Incorporating this using the solution found above, we obtain the result that the strategy will require sending about $65484$ attendees to retreats. At a cost of \$1,000 per attendee, this means it will cost at most \$65.48 million to find all the spies.

For the curious, here is a plot showing the maximum number of retreats required to sniff out 17 spies, as a function of the total number of agents:

As we can see, the upper and lower bounds start close together but end up spreading apart as $n$ increases.

Minimizing cost instead?

The problem statement asked us to minimize the number of retreats, which led us to 186 retreats at a cost of \$65.48 million. If instead we attempted to minimize dollars using a similar strategy of dividing into groups and keeping one group home, we can formulate a similar recursion to the one above, except this time we perform the minimization over cost rather than retreats. The result in this case is 220 retreats at a cost of \$54.46 million. This makes sense. We can lower the cost if we want, but only at the expense of adding more retreats.

Even better!

It turns out the upper bound can be reduced to $131$ by instead singling out one spy at a time! This turns out to have a much better worst-case performance than splitting the remaining agents into equally-sized groups as I described in my solution. As one might expect, the worst-case cost in dollars goes up dramatically. The solution, due to Tim Black, can be found here. It still remains to be seen whether there exists a simple strategy that can further close the gap between the lower bound of 122 and the new upper bound of 131.