Droplets Forming: Ninthers, Tukey and statistical approximation

Wednesday, June 24, 2009

Ninthers, Tukey and statistical approximation

A response to a post on The Endeavour about Ninthers and Tukey:

There are 2 useful things in the ninther concept: approximations to bulk statistics and the calculation of a median without any sorting.

Generalising from 3 groups of 3 to n groups of m, we could still calculate a median from a series of chunks of the dataset, but we would need to sort.

This would suggest problems when working on Very Large Datasets, but consider the case of the most annoying dataset - randomised data. If we take a chunk of data from this dataset, we can approximate the statistics of the bulk with the statistics of the chunk, or a series of chunks.

The answer may be, then, to randomly pick out a series of 9-value chunks, and calculate a series of ninthers. That way the number of comparisons per total values can be less than 1.

O(1) to O(N), depending on how accurate you require your statistics to be.

No comments:

Post a Comment

Quotes

"Books won't stay banned. They won't burn. Ideas won't go to jail. In the long run of history, the censor and the inquisitor have always lost. The only sure weapon against bad ideas is better ideas." ~ Alfred Whitney Griswold

"If at first the idea is not absurd, then there is no hope for it." ~ Albert Einstein

"We are told to remember the idea, not the man, because a man can fail. He can be taught, he can be killed and forgotten but, 400 years later, an idea can still change the world. I've witnessed first hand the power of ideas, I've seen people kill in the name of them, and die defending them...but you cannot kiss an idea, cannot touch it, or hold it...ideas do not bleed, they do not feel pain, they do not love." ~ Evey Hammond in V for Vendetta

" Ideas are somewhat like babies--they are born small, immature, and shapeless. They are promise rather than fulfillment. In the innovative company executives do not say, "This is a damn-fool idea." Instead they ask, "What would be needed to make this embryonic, half-baked, foolish idea into something that makes sense, that is an opportunity for us?" " ~ Peter Drucker, The Frontiers of Management (1986)