Skip to main content

Beware the average

Which one's the average house?
I was struck by an item on the local news this morning saying that the average house price in the UK was £163,910 according to the Nationwide Building Society. This seemed a dubious statistic. Why? Because the average (or mean) is not a good measure of a distribution that isn't symmetrical. It's highly misleading. That's because the vast majority of houses in the UK are worth less than the average house price - and that is downright confusing.

Let's look at a simpler example to see what's going on. Imagine we have a room full of people and take their average earnings. Then we throw Bill Gates into the room. Bill's vast income would really bump up the average - so probably everyone else in the room would earn less than the average. The new average would not be representative of the room as a whole.

The reason a relatively small number of cases (in our room, Bill) can have a big impact is because the distribution - the spread of the incomes - is not symmetrical. Let's say the average income before Bill entered the room was £26,000 a year. Then the absolute maximum anyone can fall below that average is by £26,000. But there is no limit to how far above the average you can be. In Bill's case, he will be millions higher. So he has a much bigger impact on the average than a poor person does.

In such cases, the median is a very valuable number to know. This is just the middle value. We put all the people in a row in order of earnings and pick the middle number. With a distribution like our room - or house prices - the median gives us a much better feel for what a typical value is like than the average.

Which takes us back to the Nationwide. I took the liberty of dropping their Chief Economist, Robert Gardner an email and he was kind enough to call me back within 10 minutes (and to email through some bumf). You really wouldn't expect a financial institution to make such a basic statistical mistake... and they haven't. What the Nationwide repeatedly calls an average in their press releases isn't a simple average at all. Instead they stratify the data according to region, type of house and so forth and produce a rather messy weighted figure that could arguably be said to be the typical value - but it certainly isn't an average.

You can argue whether they should be rather clearer about just what the figure they are producing is, rather than calling it the average house price as they do, but at least it is a meaningful figure.

In other statistics, I'm afraid the press simply gets the words wrong. Quite often a government bureau will publish a median value and an average - they do so on earnings, for instance. What the media often does is to take the median value, because it's more meaningful, but calls it the average (presumably because they think the poor public can't cope with a hard word like 'median'). That's just bad journalism.

This distortion of the average is something that politicians wishing to attack another party and not being too scrupulous about their statistics can use to their advantage. If we want to tax those on high earnings and find the tax hits someone on the average wage, then there is an outcry, because that seems to imply that it hits the majority of ordinary people – but the majority actually earn less than the average wage. The naughty politician can play the numbers even more effectively by putting two people on an average wage into a household. Now we are not only using individuals that earn more than most, but a household where both partners do so. This pushes their collective income up so high that it puts the household in the top 25 per cent of all households, even though we are talking about two people who are on an average wage.

There's a simple message. Whenever you hear 'average' in statistics on the news or see them presented, it's worth taking the numbers with a pinch of salt unless you can verify just what lies behind that value.

Comments

Post a Comment

Popular posts from this blog

Why I hate opera

If I'm honest, the title of this post is an exaggeration to make a point. I don't really hate opera. There are a couple of operas - notably Monteverdi's Incoranazione di Poppea and Purcell's Dido & Aeneas - that I quite like. But what I do find truly sickening is the reverence with which opera is treated, as if it were some particularly great art form. Nowhere was this more obvious than in ITV's recent gut-wrenchingly awful series Pop Star to Opera Star , where the likes of Alan Tichmarsh treated the real opera singers as if they were fragile pieces on Antiques Roadshow, and the music as if it were a gift of the gods. In my opinion - and I know not everyone agrees - opera is: Mediocre music Melodramatic plots Amateurishly hammy acting A forced and unpleasant singing style Ridiculously over-supported by public funds I won't even bother to go into any detail on the plots and the acting - this is just self-evident. But the other aspects need some ex

Is 5x3 the same as 3x5?

The Internet has gone mildly bonkers over a child in America who was marked down in a test because when asked to work out 5x3 by repeated addition he/she used 5+5+5 instead of 3+3+3+3+3. Those who support the teacher say that 5x3 means 'five lots of 3' where the complainants say that 'times' is commutative (reversible) so the distinction is meaningless as 5x3 and 3x5 are indistinguishable. It's certainly true that not all mathematical operations are commutative. I think we are all comfortable that 5-3 is not the same as 3-5.  However. This not true of multiplication (of numbers). And so if there is to be any distinction, it has to be in the use of English to interpret the 'x' sign. Unfortunately, even here there is no logical way of coming up with a definitive answer. I suspect most primary school teachers would expands 'times' as 'lots of' as mentioned above. So we get 5 x 3 as '5 lots of 3'. Unfortunately that only wor

Which idiot came up with percentage-based gradient signs

Rant warning: the contents of this post could sound like something produced by UKIP. I wish to make it clear that I do not in any way support or endorse that political party. In fact it gives me the creeps. Once upon a time, the signs for a steep hill on British roads displayed the gradient in a simple, easy-to-understand form. If the hill went up, say, one yard for every three yards forward it said '1 in 3'. Then some bureaucrat came along and decided that it would be a good idea to state the slope as a percentage. So now the sign for (say) a 1 in 10 slope says 10% (I think). That 'I think' is because the percentage-based slope is so unnatural. There are two ways we conventionally measure slopes. Either on X/Y coordiates (as in 1 in 4) or using degrees - say at a 15° angle. We don't measure them in percentages. It's easy to visualize a 1 in 3 slope, or a 30 degree angle. Much less obvious what a 33.333 recurring percent slope is. And what's a 100% slope