Pick a positive integer at random. What is the chance of it being 100? What or the odds that it is even? What is the likelihood that it is prime?

Probability distribution for s=1.1 (red), s=1.01 (blue) and s=1.001 (black).
Since the set of natural numbers is infinite, there are difficulties in assigning probabilities to subsets of
. We require the probability of the full set to be unity:
. We also require that the probability function should be additive:
for disjoint sets
and
.
We could try something like
but the denominator — and perhaps also the numerator — is infinite. So, we could try
But now the probability for finite sets and, in particular, for any specific number, is zero. So we cannot have additivity:
Density to the Rescue?
The density of a subset of
can be defined in various ways. If the elements of
are arranged in ascending order, we can define
This gives a density of for the set of even numbers, but it gives a zero density for any finite set. It also gives zero density for many infinite sets, such as the set of squares and the set of prime numbers. So, this will not do.
Another approach is to define probabilities directly, weighting smaller numbers more than larger ones. After all, we may argue that a smaller number is more likely to be chosen in a `random’ selection than a larger one.
We could select a convergent series like that sums to unity and define
But the chances of an even moderately large number seem unreasonably small: . Moreover, the chances of a randomly chosen number being even or odd are
which hardly gels with our intuition. We need a series that converges much more slowly than the geometric series.
The Euler Zeta-Function
As is well known, the harmonic series diverges:
On the other hand, the Basel series converges:
If we reduce the index from 2 towards 1, we get a series whose convergence is increasingly slow. It is Euler’s zeta-function, the real case of the Riemann zeta-function:
Now we can normalise the series, dividing each term by , and define
This is well-defined for . It allows us to assign a probability to any subset of the natural numbers. Moreover,
. Of course, the probability values are sensitive to
.
As approaches 1, the distribution becomes increasingly uniform (see Figure at top). For
, the chances of picking a number less than one million are
For , this is reduced to
. But for infinite sets, things are better. Here are some results for decreasing
, calculated with the first million terms
We see that the probabilities for both even and odd numbers are approaching 50%, but the rate of approach is painfully slow. It is possible to use asymptotic results to obtain estimates much more efficiently.
Uniform Probability and Surreals
If we argue that all numbers are `equally likely’, we should choose a uniform distribution. But, since we insist that , this works only for a bounded set of numbers:
We can consider what happens as : then,
for all
.
But what about surreal numbers? Let us return to our initial thought:
The cardinality of is the surreal number
, so this is
For a singleton, , we have
. And we have additivity:
If we now assume that is an even number, it follows that
Isn’t that nice!
Sources
Diaconis, Persi and Brian Skyrms, 2018: Ten Great Ideas About Chance. Princeton Univ. Press, 255 pages [See Chapter 5].
The Root of Infinity: it’s Surreal. Post on this blog.