Skip to main content

Pots, balls and the Reverend Bayes

I'm reading a book called Bayes' Rule by James V. Stone for review, which has reminded me of the delightful case of the mathematician's coloured balls. (Mathematicians often have cases of coloured balls. Don't ask me why.)

This is a thought experiment that helps illustrate why we have problems dealing with uncertainty and probability.

Imagine I've got a jar with 50 white balls and 50 black balls in it. I take out a ball but don't look at it. What's the chance that this ball is black?

I hope you said 50% or 50:50 or 1/2 or 0.5 - all ways of saying that it has equal chances of being either white or black. With no further information that's the only sensible assumption.

Now keep that ball to one side, still not looking at it. You pull out another ball and you do look at this one. (Mathematicians know how to have a good time.) It's white.

Now what's the chance that the first ball was black?

You might be very sensibly drawn to suggest that it's still 50:50. After all, how could the probability change just because I took another ball out afterwards? But the branch of probability and statistics known as Bayesian tells us that probabilities are not set in stone or absolute - they are only as good as the information we have, and gaining extra information can change the probability.

Initially you had no information about the balls other than that there were 50 of each colour in the pot. Now, however, you also know that a ball drawn from the remainder was white. If that first ball had been black, you would be slightly more likely to draw a white ball next time. So drawing a white makes it's slightly more likely that the first ball was black than it was white - you've got extra information. Not a lot of information, it's true. Yet it does shift the probability, even though the information comes in after the first ball was drawn.

If you find that hard to believe, imagine taking the example to the extreme. I've got a similar pot with just two balls in, one black, one white. I draw one out but don't look at it. What's the chance that this ball is black? Again it's 50%. Now lets take another ball out of the pot and look at. It's white. Do you still think that looking at another ball doesn't change the chances of the other ball being black? If so let's place a bet - because I now know that the other ball is definitely black.

So even though it appears that there's a 0.5 chance of the ball being black initially, what is really the case is that 0.5 is our best bet given the information we had. It's not an absolute fact, it's our best guess given what we know. In reality the ball was either definitely white or definitely black, not in some quantum indeterminate state. But we didn't know which it was, so that 0.5 gave us a best guess.

One final example to show how information can change apparently fixed probabilities.

We'll go back to the first example to show another way that information can change probability. Again I've got a pot, then with 50 black and 50 white balls. I draw one out. What's the probability it's black? You very reasonably say 50%.  So far this is exactly the same situation as the first time round.

I, however, have extra information. I now share that information with you - and you change your mind and say that the probability is 100% black, even though nothing has changed about the actual pot or ball drawn. Why? Because I have told you that all the balls at the bottom of the pot are white and all the balls at the top are black. My extra information changes the probabilities.

If the mind-boggling aspects of probability and randomness interest you, find out far more in Dice World.

Comments

  1. This stuff is fun, and I like your illustrations. Thanks for sharing!

    ReplyDelete

Post a Comment

Popular posts from this blog

Why I hate opera

If I'm honest, the title of this post is an exaggeration to make a point. I don't really hate opera. There are a couple of operas - notably Monteverdi's Incoranazione di Poppea and Purcell's Dido & Aeneas - that I quite like. But what I do find truly sickening is the reverence with which opera is treated, as if it were some particularly great art form. Nowhere was this more obvious than in ITV's recent gut-wrenchingly awful series Pop Star to Opera Star , where the likes of Alan Tichmarsh treated the real opera singers as if they were fragile pieces on Antiques Roadshow, and the music as if it were a gift of the gods. In my opinion - and I know not everyone agrees - opera is: Mediocre music Melodramatic plots Amateurishly hammy acting A forced and unpleasant singing style Ridiculously over-supported by public funds I won't even bother to go into any detail on the plots and the acting - this is just self-evident. But the other aspects need some ex

Is 5x3 the same as 3x5?

The Internet has gone mildly bonkers over a child in America who was marked down in a test because when asked to work out 5x3 by repeated addition he/she used 5+5+5 instead of 3+3+3+3+3. Those who support the teacher say that 5x3 means 'five lots of 3' where the complainants say that 'times' is commutative (reversible) so the distinction is meaningless as 5x3 and 3x5 are indistinguishable. It's certainly true that not all mathematical operations are commutative. I think we are all comfortable that 5-3 is not the same as 3-5.  However. This not true of multiplication (of numbers). And so if there is to be any distinction, it has to be in the use of English to interpret the 'x' sign. Unfortunately, even here there is no logical way of coming up with a definitive answer. I suspect most primary school teachers would expands 'times' as 'lots of' as mentioned above. So we get 5 x 3 as '5 lots of 3'. Unfortunately that only wor

Which idiot came up with percentage-based gradient signs

Rant warning: the contents of this post could sound like something produced by UKIP. I wish to make it clear that I do not in any way support or endorse that political party. In fact it gives me the creeps. Once upon a time, the signs for a steep hill on British roads displayed the gradient in a simple, easy-to-understand form. If the hill went up, say, one yard for every three yards forward it said '1 in 3'. Then some bureaucrat came along and decided that it would be a good idea to state the slope as a percentage. So now the sign for (say) a 1 in 10 slope says 10% (I think). That 'I think' is because the percentage-based slope is so unnatural. There are two ways we conventionally measure slopes. Either on X/Y coordiates (as in 1 in 4) or using degrees - say at a 15° angle. We don't measure them in percentages. It's easy to visualize a 1 in 3 slope, or a 30 degree angle. Much less obvious what a 33.333 recurring percent slope is. And what's a 100% slope