# Morbid statistics

briantrumpet
Posts:

**12,476**
Apologies for the morbid nature of this post (if you've been affected by a death of a friend or family cyclist I suggest you bypass this post).

Something that interests me is the sad but amazingly predictable number of cyclist deaths each year in the UK. Part of the reason this catches my eye is that, in fact, though the 100 or so deaths each year is a horrible waste of life, it is both remarkably low for the number of miles cycled in this country each year (about 3 billion), but also stays within quite a narrow margin too - a "20% increase" actually is (statistically) a tiny increase for the number of miles cycled and number of people on the roads. On average, two cyclists will be killed each week in the UK (compare that with about 40 motorists), and even with the cluster in London in the latter part of 2013, I think the annual totals will be roughly in line with the preceding years.

Now here's the thing - when each of those deaths is, well, if not random, the outcome of a unique set of circumstances (location, people, time, etc.), I can't wrap my head around how consistent the numbers are. Given the random nature of the events, I'd expect a much wider number of deaths - maybe 50 one year, 200 the next - but we are left with two per week, on average, for the past several years. What is 'causing' that consistency, in such widely spaced events?

Something that interests me is the sad but amazingly predictable number of cyclist deaths each year in the UK. Part of the reason this catches my eye is that, in fact, though the 100 or so deaths each year is a horrible waste of life, it is both remarkably low for the number of miles cycled in this country each year (about 3 billion), but also stays within quite a narrow margin too - a "20% increase" actually is (statistically) a tiny increase for the number of miles cycled and number of people on the roads. On average, two cyclists will be killed each week in the UK (compare that with about 40 motorists), and even with the cluster in London in the latter part of 2013, I think the annual totals will be roughly in line with the preceding years.

Now here's the thing - when each of those deaths is, well, if not random, the outcome of a unique set of circumstances (location, people, time, etc.), I can't wrap my head around how consistent the numbers are. Given the random nature of the events, I'd expect a much wider number of deaths - maybe 50 one year, 200 the next - but we are left with two per week, on average, for the past several years. What is 'causing' that consistency, in such widely spaced events?

0

## Posts

1,6745,273Yes I feel the key there is average.

You could have 6 one week, 15 in a day and nothing for a while - that is the problem with statistics (and averages).

The statistic is constant - I suspect the actual events are very random indeed.

3,060I think you're misunderstanding the mechanisms at play.

If the number did change from 50 one year to 200 the next it would be incredibly unlikely that there isn't a specific common factor influencing this.

Think of it this way. Lets say I flipped a coin 5 times and then did the same again. There would be nothing odd about the results if I got 4 heads/1 tail the 1st time and 2 heads/3 tails the 2nd time. Agreed? That would mean the occurrence of tails tripled between the two attempts on a completely random event.

Now what if I did 1000 coin flips each time. What's the likelihood of getting vastly more heads than tails the 1st time? It's pretty unlikely. The more times you do a 50/50 test the closer the result will tend to be to 50/50. So the two tests are far more likely to match up pretty closely than if you only have a few events each time.

You COULD still get significantly more heads or tails but that's very unlikely unless there's a cause. If there's a cause then it's not a random result.

3,0605,273No you have it the wrong way round. Statistics are only used to make/underpin a specific point, thus are only ever used in a way that the author chooses.

If you measure something for long enough you can come up with any number you like (you just select what criteria you want).

Probabilty is actually what is useful, not statistics (we have covered this on here before somewhere).

If you take those statistics they mean nothing without context - probabilty gives the context. E.G. that 2 per week could be really bad or really good, without knowing the total number of cyclist we have no idea.

8,4982 cyclist deaths per week is an ambiguous figure as we cannot gauge the causes of those deaths.

To make it more meaningful then we need to know

1) what type of environment those deaths occured in eg: road/offroad/city/town/rural

2) time of day

3) what the primary cause of death was ie if it was an accident then who was the "at fault" party. If it was ill-health then although it is a cyclist death, cycling wasn't the cause of the death.

I would think that if you took the raw data and applied a location stamp to it then most deaths occur in a city during a commute and the accident was with another vehicle - but then I would hazzard a guess that there are more cyclists commuting in cities than other areas.

2,518891But in order to study a trend over time, you naturally need stats collated against time. so looking at a single year performance compared with the previous year doesn't tell anyone anything.

But the previous post regarding the general road condition would be an interesting hypothesis to see if the statistics back it up. As if the year on year trend in localised areas are constant or it is trending up/down. and you can easily corrolate that to inititives taken by local authorities to see which scheme or road planning is the most/least sucessful and find out why it is so good/bad and apply it nationally if possible.

this won't take a team of statistians long to corrolate and fingure out and certainly won't cost the government huge amount of money. but i suspect it is the lack of interest in making roads better for cyclists in the whitehall that is stopping the death rate going down.

that said, the increase number of people using bikes are on the increase and therefore naturally there will be more accidents.

Road - Chinese Carbon Diablo - 6.4kg

3,060You say probablility is useful, not statistics. This makes no sense. What are you saying is the difference? Statistics is the set of tools used to calculate and work with probability based data. Probability can't really be of any use without statistics. Essentially if you ridicule one you ridicule the other.

5,273Statistics are history (dealing in past events) - they merely provide a numerical way of communicating what has already happened. There is often no context (as is the case with the OP's stats)

Probability deals with future events - It help us to predict how likely something is to happen based on known parameters. This is the bit that I am saying is missing in the statistics.

Simply quoting a statistic of 2 people die every week means nothing as we have no idea if that represents a high or low risk. People need actual useable data to enable them to make informed choices.

A statistic is a statement of fact (however contrived), nothing more.

3,060Statistics are not simply history. Probability is not the future. They are both mathematical terms not philosophical ideas. Probability and statistics are not two different things. Statistics is a mathematical discipline. Probability is a term dealt with extensively within the field of statistics.

If you studied statistics in school and/or college you'll probably remember that it was part of your maths curriculum. Not history, literary fiction, marketing or religious studies. It's a mathematical discipline. It's based on logical concepts and provable hypotheses. I'm not just saying that statistics are fact because it's my opinion and I'm arrogant. It's a fact. It's not an opinion.

Almost everyone who does engineering, experimental physics or a whole range of other subjects, especially in the sciences, will have had no choice but to do their fair share of statistics. Why?....because that's how you prove you're NOT just making stuff up. It's how you demonstrate correlation between theory and reality, or very often, it's how you discover your theory is flawed. Statistics can be used to analyse the past just as it can be used to predict the future. For complex problems it's often the only tool available. It's often the tool that reveals the truths we're not bright enough to recognise. It's exactly the opposite to what you seem to think it is. If some people doctor their results, misrepresent their data or just don't know what they're doing, that doesn't reflect on statistics. It reflects on them.

I don't particularly enjoy statistics and I'm not an expert but I know enough to understand it's a massively powerful tool. It's one of the pillars on which science, technology and indeed society is based..... yet you seem to think it's just a cynical a marketing tool.

Used well, statistical tools are probably the most accurate, revealing and objective we have.

I can't understand what point you're trying to make when you say that a statistic is a statement of fact and nothing more. A probability is a fact .... where are we going with this?

199I'm making the assumptions that (i) the number of deaths per year is an absolute, as opposed to proportional, value and (ii) that the number of people cycling is on the up.

13,123Almost every statistician I have ever met has cautioned against over reliance on statistical analysis.

I particularly like:

"Some people use statistics in the same way a drunk uses a lamp-post; more for support than illumination"

12,476But I did fail A-level maths. And I never did do statistics, because I was in the top set. Haha.

3,060This is the answer to the question you asked at the start of the thread.

5,273At last someone talking sense.

5,273OK you flip a coin 100 times and it lands on heads every time.

Statistically how many times has it landed on heads?

What is the probabilty it will land on heads the next flip?

3,060I presume you're trying to make a point but I'm at a loss to see what it is. The above in no way disagrees with anything I've said or invalidates anything. The analogy is valid. There is no mystery in the how or why of any of this. It's pretty basic maths. These aren't my ideas or opinions. It pretty common unremarkable knowledge. Look it up or ask the opinion of someone you'll believe. I'm just wasting my time at this point.

5,273Exactly.

Statistically it would be guarenteed to land on heads, but in reality it has a 50/50 chance.

So statistics and probabilty give 2 different answers not the same as you would have us believe.

1,5675,2731,567You have come out with some real guff, like 'statistics deals with history, probability with the future' and then told someone else trying to politely correct your misinformation with the truth that he was wrong. Stunningly wrongheaded.

5,273Which statistically may be true but in all probability you are still not correct. And the outcome of me being wrong is not even possible. There is still a chance though.

This has now ceased to be sufficent a challenge and is certainly adding nothing the the thread so I am out.

1,567Statistics -

The mathematics of the collection, organization, and interpretation of numerical data, especially the analysis of population characteristics by inference from sampling.

Probability -

The likelihood of the occurrence of an event. The probability of event A is written P(A). Probabilities are always numbers between 0 and 1, inclusive.

3,060You're coming up with spurious conclusions due to your own misunderstanding and then attributing the blame to an entire field of mathematics instead of listening to those trying to explain where you've gone wrong. It's astonishing.

To be clear. You're willing to believe that all mathematicians, scientists, engineers and actuaries amongst others are morons rather than consider the possibility that you should look up the meaning of the word "statistics".

I'm not sure if it's funny, sad or both.

I just hope no one is taking you seriously.

8,498Statistically speaking you'd be more likely to get another head though - probably a rigged coin either weighted one side or has 2 head sides. Or perhaps it was just the magnetic flux at that point in time. Eitherwhich way, the probability of throwing a tails has been reduced to almost zero with so many heads being thrown in a row.

328I'm no mathematician, statistician or probabilitist, but I do love wathching someone dig themselves deeper.

Keep going.

3,060In this analogy the assumption was that the coin flip was a 50/50. Under those circumstances the probability of 100 consecutive heads is so infinitesimally small that it is for all sensible purposes impossible.

So, I would agree with you that the 100 in a row result is so hugely statistically significant that you'd certainly be justified in saying the test is flawed and not actually random. I didn't want to go into this as I was having enough trouble trying to get across an explanation of much more basic concepts.

4,913If you go out and collect a load of data and present them, the cold, hard data (with no analysis) could be called the statistics.

so, 200 people die a year is a statistic. if, on average, 200 people died every year for 5 years and you were just looking at these statistics and someone asked you how many people will die next year, you'd probably say 200.

but statistics can also be the interpretation of the data as well, when something becomes statistically significant or using chi-squared, z-tests and all that jazz. so if someone asked you the say question and you used statistics (in this meaning) you would probably have a different answer!

neither person is wrong, they are just using the word statistics differently!

in my opinion anyway, i have a 90% confidence limit on this theory by the way!!

5,273Your definitions agree with my simple summary.

History. You can not collect something which does not exist (has not occurred).Future. If something has already happened it has no likelhood - it is absolute/difinitive/already occurred.I have never said that statistics are not used as part of prediction, nor have I claimed that they are unrelated to probabilty.