This problem is a twist on the classical Coupon Collector Problem<\/a>. In this problem, there are $n$ different coupons in a basket. For one dollar, we get to pick a coupon at random from the basket. We then return the coupon to the basket. We keep doing this until we\u2019ve seen each of the $n$ different coupons at least once. How many dollars will we spend on average?<\/p>\n

Simpler version: packs of one<\/h3>\n
Let\u2019s start by solving the standard coupon collector problem. First, we\u2019ll need the following fact: if a coin comes up Heads with probability $p$, then we have to flip it on average $1\/p$ times before we obtain our first Heads. To see why, let $x$ be the expected number of flips. If the first flip is Heads, (probability $p$), then we have performed 1 flip and we stop. If the first flip is Tails, (probability $1-p$), then we have performed 1 flip but we will need to perform $x$ more on average. Mathematically, this amounts to:
\n\\[
\nx = p\\cdot 1 + (1-p)\\cdot (1+x)
\n\\]Solving for $x$ yields $x=1\/p$. You can also solve this using recursion, as in my solution to the Monsters\u2019 gems puzzle<\/a>.<\/p>\n
We can think of the coupon collection problem by asking how many draws are required before we pick our next new<\/em> coupon. Our first coupon is always new (probability 1) and will take 1 draw. The probability of picking our next new coupon is $\\tfrac{n-1}{n}$ since any of the $n-1$ remaining coupons will do, and this will take on average $\\tfrac{n}{n-1}$ draws. Next, our probability is $\\tfrac{n-2}{n}$, which takes on average $\\tfrac{n}{n-2}$ draws, and so on. Therefore, the expected number of draws $C_n$ in order to pick all $n$ coupons is:
\n\\begin{align}
\nC_n
\n&= \\frac{n}{n} + \\frac{n}{n-1} + \\frac{n}{n-2} + \\cdots + \\frac{n}{1} \\\\
\n&= n \\left( 1 + \\frac{1}{2} + \\cdots + \\frac{1}{n} \\right) \\\\
\n&\\approx n (\\log n + \\gamma)
\n\\end{align}Where $\\gamma \\approx 0.5772$ is the Euler-Mascheroni constant<\/a> and the approximation becomes exact as $n\\to\\infty$. This is a harmonic sum<\/a> and it has surfaced in several past problems (bears<\/a>, camels<\/a>, dwarfs<\/a>).<\/p>\n

Full version: multi-packs<\/h3>\n
Let\u2019s now assume that we draw $m$ coupons at a time from the basket, which contains $n$ distinct coupons. This is precisely the original problem (with $m=10$ and $n=100$). Let\u2019s call $X_k$ the expected number of additional draws required to obtain all $n$ coupons given that we have already collected $k$ coupons. We could get lucky and obtain several new coupons in our draw, or we could strike out and receive all duplicates. In general, there are $\\binom{k}{m-i}\\binom{n-k}{i}$ ways to obtain $i$ new coupons (out of $\\binom{n}{m}$ total possible draws), since we can choose $i$ out of the $n-k$ possible new coupons remaining and $m-i$ out of the $k$ already-seen coupons. Upon drawing our $i$ new coupons, we will have to make $X_{k+i}$ further draws. We therefore have the recursion:
\n\\[
\nX_k = 1 + \\sum_{i=0}^m \\frac{\\binom{k}{m-i}\\binom{n-k}{i}}{\\binom{n}{m}} X_{k+i}
\n\\]Noting that $X_k$ occurs on both sides, we can simplify and obtain:
\n\\[
\nX_k = \\frac{1}{\\binom{n}{m}-\\binom{k}{m}}\\left( \\binom{n}{m} + \\sum_{i=1}^m \\binom{k}{m-i}\\binom{n-k}{i} X_{k+i} \\right)
\n\\]Now each $X_k$ depends on $X_{k+1},X_{k+2},\\dots$, and we have the terminal condition $X_n = 0$. So we can work backwards, first evaluating $X_{n-1}$, then $X_{n-2}$, and so on until we arrive at $X_0$, which is the final answer we seek. As a side note, the fact that the probabilities above sum to $1$ is a consequence of Vandermonde\u2019s Identity<\/a>, which I discussed in my post on double counting<\/a>.<\/p>\n
It doesn\u2019t appear that there is a nice closed-form expression for $X_0$, however it\u2019s straightforward to do this numerically, as long as care is taken with the binomials causing integer overflow when $n$ and $m$ get large. We can also approximate the solution decently well: the $m=1$ case (one card per pack) requires approximately $n(\\log(n)+\\gamma)$ packs to collect all cards, so one might expect that with $m$ cards per pack, one could collect all cards roughly $m$ times faster. This turns out to be a good approximation!<\/p>\n
In the plot below, I computed the exact expectation and also showed the approximation $\\tfrac{n}{m}(\\log(n)+\\gamma)$.<\/p>\n
$\"\"$ <\/a><\/p>\n
Numerical solutions<\/h3>\n
The original problem statement asked about the case of $m=10$ cards per pack, with either $n=100$ or $n=300$ total different cards. Here is Julia<\/a> code that evaluates the solution rather quickly:<\/p>\n
\r\n# Julia 0.6.4\r\n# define binomial for large values of n (use infinite precision)\r\nbinom(n,m) = binomial(big(n),m)\r\nfunction expected_draws(n,m)\r\n X = zeros(n+m+1)\r\n for k = n-1:-1:0\r\n q0 = binom(n,m)\/(binom(n,m)-binom(k,m))\r\n q = [ binom(k,m-i)*binom(n-k,i)\/(binom(n,m)-binom(k,m)) for i in 1:m ]\r\n X[k+1] = q0 + sum(q .* X[k+2:k+1+m])\r\n end\r\n X[1]\r\nend\r\nprintln(\"100 cards, packs of 10: Number of packs = \", expected_draws(100,10))\r\nprintln(\"300 cards, packs of 10: Number of draws = \", expected_draws(300,10))\r\n<\/pre>\n
and the output of the above code is:<\/p>\n
\r\n100 cards, packs of 10: Number of packs = 49.94456605666412\r\n300 cards, packs of 10: Number of draws = 186.0851712198894\r\n<\/pre>\n
Therefore, it requires about 50 packs (5 weeks allowance) on average to collect all cards in the 100-set, and about 186 packs (18.6 weeks allowance) for the 300-card set. If we use the approximation $\\tfrac{n}{m}(\\log(n)+\\gamma)$, we actually come pretty close to the exact answer:<\/p>\n
\r\n100\/10 * (log(100) + \u03b3) = 51.8238585088962\r\n300\/10 * (log(300) + \u03b3) = 188.429944186732\r\n<\/pre>\n<\/div>\n
<\/p>\n<\/body>","protected":false},"excerpt":{"rendered":"
This Riddler puzzle is a classic probability problem: how long can one expect to wait until the entire set of cards is collected? My son recently started collecting Riddler League football cards and informed me that he planned on acquiring every card in the set. It made me wonder, naturally, how much of his allowance … Continue reading “Card collection completion”<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":2440,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"om_disable_all_campaigns":false,"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"_uf_show_specific_survey":0,"_uf_disable_surveys":false,"footnotes":""},"categories":[7],"tags":[29,18,8,2],"class_list":["post-2437","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-riddler","tag-classics","tag-linearity-of-expectation","tag-probability","tag-riddler"],"aioseo_notices":[],"aioseo_head":"\n\t\t\n\t\n\t\n\t\n\t\n\t\n\t\t\n\t\t\n\t\t\n\t\t\n\t\t\n\t\t\n\t\t\n\t\t\n\t\t\n\t\t\n\t\t\n\t\t