This lesson teaches students to be cautious of bad science of all kinds and aware of the signs thereof. We ask students to study a few established examples of pathological science in history dressed as science merely in form. The emphasis is on well-intentioned researchers "falling in love" with their own ideas, finding excuses to justify them even when reality has turned out to be contrary to them.
Although blatant pseudoscience such as flat earth, astrology, creationism, and alternative medicine are also included, what deserves particular attention is seemingly genuine science done by seemingly genuine people that is nevertheless incorrect, possibly due to the researchers' own hubris.
Everyone in principle has access to the same shared reality. If a certain (amazing) result by one research group cannot seem to be replicated by many other groups, it is a good sign that it does not accurately describe the shared reality.
It is expected that patterns arise from random noise. Good science can also turn out to be wrong just by chance, but stubbornly sticking with the original result would turn it into bad science.
In the spirit of personal persistence and iterative progress, setbacks such as a wrong or disappointing result should be accepted as an inevitable part of this progress.
Though not yet widely employed in all fields of science, various blind analysis techniques can help reduce the possibility that a scientific result may be contaminated by subtle analysis choices made by the researchers that are (often subconsciously) motivated by the desire for a certain anticipated result.
Bad science is particularly problematic when it concerns the study of human subgroups, as it may be motivated by or may perpetuate preexisting inequitable power structures in society.
Takeaways
After this lesson, students should
Be able to distinguish roughly among the following:
Good science (that gets the wrong or right answer).
Fraudulent science.
Pathological science.
Poorly-done science.
Pseudo-science.
Feel comfortable using Langmuir's Pathological Science Indicators to assess scientific studies.
Be able to identify what is wrong in cases of fraudulent, pathological, poorly-done, and pseudo-science.
Spectrum of Poor Research
There are many gradations to poor research and poor results. Several approximate categories are listed below.
Good Science
This includes good well-done science that gets the wrong result. Even with a confidence interval of 95%, 5% of the time the results will be wrong. This is normal.
Poorly-done Science
Science done honestly, but not done well.
Pathological Science
This is what happens when a researcher falls a little too "in love" with their ideas. It is indicated by Langmuir's criteria.
Pseudo-science
Pseudo-science is characterized by using scientific vocabulary without aligning with the corresponding concepts or engaging in real scientific practices (i.e., science being "skin deep," not scientific below the surface).
Fraudulent Science
Research that involves intentional deception, such as deliberately fabricating data or deliberately deceiving the reader about the strength of evidence.
Langmuir's Pathological Science Indicators
A set of six indicators that can be used to flag when something might be pathological science.
The effect is produced by a barely detectable cause, and the magnitude of the effect is substantially independent of the intensity of the cause.
The effect is barely detectable, or has very low statistical significance.
Claims of great accuracy.
Involving fantastic theories contrary to experience.
Criticisms are met with ad hoc excuses.
Ratio of supporters to critics rises to near 1:1, then drops back to near zero.
There is no set number of Langmuir's criteria that determines whether something is pathological science or not. The criteria serve as a useful guide, but detecting pathological science is ultimately something of an art. That being said, whether or not a given study is a case of pathological science depends on the attitude of the researchers. Indicators 3 and 5 speak to this most directly.
This paper's result could not be replicated by others in the field, therefore the authors have committed pathological science.
There may be a genuine difference in the way the study is conducted between these papers, contributing to a difference in result. The original study may have reached the wrong conclusion from correct analysis simply due to the random noise in the data. The original study may have made an error in their analysis that has not been discovered (for example, superluminal neutrinos).
Each student will have read one of the three studies that we discuss in this section, as well as an study by Langmuir. Assign each group to one of the three studies one week ahead of time, and then send another reminder to read them one or two days before the lesson. Also make sure to print the handout.
During Class
5 Minutes
Introduce the lesson and go over the plan for the day. Make sure people have groups, spokespeople, etc.
Students are assigned one of three scientific studies to skim through before class and asked to assess the validity of its methodology, with reference to Langmuir's indicators for pathological science. They may follow the handout.
Students review their studies in small groups and answer the questions in the handout. A note taker should be responsible for writing down ideas from the group.
20 Minutes
Since each group has been assigned one of three studies, we now bring three groups with different studies together into larger pods of groups. Each group's spokesperson will spend 3-5 min presenting their group's study to the other groups using their answers to the small-group questions on the handout as a guide.
25 Minutes
Call the whole class together and go over the final discussion questions. Try to come up with a class consensus for each study.
Small-group Questions
These are the questions on the handout that small groups will use to guide their discussion. Only after taking notes that answer these will they go over the final discussion questions. These questions are the same for each group no matter what study they were assigned.
What was the research question and hypothesis?
What variables were tested?
Do you have a sense that there were other confounding factors that were controlled for?
How did the scientific community respond? How did the scientists who conducted the research respond to the scientific community?
What did the scientists conclude? What did the scientific community conclude?
Final Discussion Questions
The poor science scale (or spectrum).
These questions are used for each study. They are initially discussed in small groups and then answered collectively by the entire class. The third question is intended mainly for small groups so as to serve as fodder for the larger discussion.
Where does this study fall on the poor science scale?
Which of Langmuir's criteria (if any) apply to this study?
Where, if anywhere, did this science go wrong?
Study A: Arsenic Bacterium
This paper discussed a novel substitution of arsenic for phosphorus in bacterial DNA, which would have profound implications on requirements for life in any form. They reported the discovery of an unusual microbe that could vary elemental composition of its basic biomolecules by substituting arsenic for phosphorus. Mechanisms for such substitution were unknown. Many criticisms were published as technical comments in Science and online, such as accusing the original team of failing to "meticulously clean up the DNA first". An Israeli team worked out the mechanism by which the bacterium in question discriminated between nearly identical molecules of phosphate and arsenate. Science also published an official statement saying some of the findings in the original paper were incorrect in 2012.
Where does this study fall on the poor science scale?
Poorly-done science. Wrong results.
Which of Langmuir's criteria (if any) apply to this study?
The effect is produced by a barely detectable cause, and the magnitude of the effect is substantially independent of the intensity of the cause.
The effect is barely detectable, or has very low statistical significance.
Claims of great accuracy.
Involving fantastic theories contrary to experience.
Criticisms are met with ad hoc excuses.
Ratio of supporters to critics rises to near 1:1, then drops back to near zero.
Where, if anywhere, did this science go wrong?
The experiment was not done carefully enough.
Study B: Water Memory
This paper provided evidence for the theory that water retains memory of previously dissolved substances, which has been claimed to be a mechanism by which homeopathic remedies work, but runs contradictory to our current understanding of chemistry and physics. However, since the paper had no apparent methodological problems, it was published in Nature accompanied with an editorial that noted "There are good and particular reasons why prudent people should, for the time being, suspend judgment" and described some of the fundamental laws of chemistry and physics which it would violate, if shown to be true. Additionally, the editor of Nature demanded that the experiments be re-run under supervision. In the supervised double-blind experiments, no memory effect was observed. Nature published a follow-up report of these findings, and their editor said: "We believe the laboratory has fostered and then cherished a delusion about the interpretation of its data." and pointed out that two of the researchers were being paid for by a homeopathic company. The PI also published a follow-up letter where he compared his treatment by the Nature team to "Salem witchhunts or McCarthy-like prosecutions". More drama ensued, and the results of the paper are still used to claim that the experiments "prove" that homeopathy works.
Where does this study fall on the poor science scale?
Pathological science.
Which of Langmuir's criteria (if any) apply to this study?
The effect is produced by a barely detectable cause, and the magnitude of the effect is substantially independent of the intensity of the cause.
The effect is barely detectable, or has very low statistical significance.
Claims of great accuracy.
Involving fantastic theories contrary to experience.
Criticisms are met with ad hoc excuses.
Ratio of supporters to critics rises to near 1:1, then drops back to near zero.
Where, if anywhere, did this science go wrong?
The researchers were very attached to their scientific findings and criticized the supervised experiments.
Study C: Superluminal Neutrinos
Neutrinos are subatomic pstudies with extremely small masses, which means that they should move at very nearly (but slightly slower than) the speed of light. In this paper, neutrinos created at CERN were detected at another site in Italy to measure the precise arrival time and distance, and therefore the speed at which neutrinos travel. They found that neutrinos traveled slightly faster than the speed of light with high significance (over 15,000 separate neutrino events). If this is correct, it would overturn a cornerstone of modern physics. The team published their preliminary data to the public to enlist the help of other scientists for searching for an explanation, and doubled down on their efforts to find an explanation for this result. The team's in-depth investigation of their experiment turned up a few errors, which when corrected found that the speed was consistent with the speed of light. Other independent experiments also found that neutrinos traveled at the speed of light, but showed no evidence for faster than the speed of light.
Where does this study fall on the poor science scale?
Good scientific practice. Wrong results.
Which of Langmuir's criteria (if any) apply to this study?
The effect is produced by a barely detectable cause, and the magnitude of the effect is substantially independent of the intensity of the cause.
The effect is barely detectable, or has very low statistical significance.
Claims of great accuracy.
Involving fantastic theories contrary to experience.
Criticisms are met with ad hoc excuses.
Ratio of supporters to critics rises to near 1:1, then drops back to near zero.
Where, if anywhere, did this science go wrong?
Even very carefully done science may inevitably contain errors. The team recognized the implications of the results and openly invited other scientists to critique their procedure, leading to the discovery and correction of the relevant errors.
This example illustrates that a couple of Langmuir's criteria in-and-of themselves do not mean that a certain study is a case of pathological science. If the researchers fully acknowledge the implications and limitations of their results and do everything in their power to prove themselves wrong then it is likely still an example of good scientific practice.