|
|
As an example, suppose two people, Ann and Bob, who are let loose on Wikipedia. In the first test, Ann improves 60 percent of the articles she edits while Bob improves 90 percent of the articles he edits. In the second test, Ann improves just 10 percent of the articles she edits, while Bob improves 30 percent.
Both times, Bob improved a much higher percentage of articles than Ann - yet when the two tests are combined, Ann has improved a much higher percentage than Bob!
The result comes about this way: In the first test, Ann edits 100 articles, improving 60 of them, while Bob edits just 10 articles, improving 9 of them. In the second test, Ann edits only 10 articles, improving 1 of them, while Bob edits 100 articles, improving 30 of them. When the two tests are added together, both edited 110 articles, yet Ann improved 61 of them (55 percent) while Bob improved only 39 of them (35 percent)!
It appears that the two sets of data separately support a certain hypothesis, but, considered together, support the opposite hypothesis.
To recap, introducing some notation that will be useful later:
SA = 100/110SA(1) + 10/110SA(2).
SB = 10/110SB(1) + 100/110SB(2).
By more extreme reweighting A's overall score can be pushed up to 60% and B's down to 30%.
The arithmetic allows us to see through the paradox but there is still the conflict between the individual performances and the overall performance: who is better, A or B? Ann and Bob's creator thought Ann was better--her overall success rate is higher. But it is possible to retell the story so that it appears obvious that B is better. A and B are now hospitals and the two tests have become two types of patient: mild and severe. The numerical data is as before: B is better at curing both types of patient but its overall success rate is worse because almost all (100/110) of its patients are severe cases while almost all of A's are mild (100/110). The association of success with A is misleading, even spurious.
In this retelling has something been added, or has a tacit assumption of the Ann and Bob story been changed? These issues are discussed in the modern literature on Simpson's paradox. Although statisticians have known about the Simpson's paradox phenomenon for over a century, there has lately been a revival of interest in it and philosophers, computer scientists, epidemiologists, economists and others have discussed it too.