5.1 False Positives and Negatives

How confident should we be about an apparent signal before we make decisions based upon it? What if we are wrong? These are questions faced by every medical patient, police officer, and president. The answer depends not only on the facts about the signal, but also on the perceived severity of the outcome in the event of an error—a signal falsely identified or inadvertently missed.

The Lesson in Context

This lesson gives students hands-on examples of false positives and negatives through a Jupyter notebook, in which they are asked to make moral judgments on the threshold for a positive test result based on the consequences of a false positive or negative.

Earlier Lessons

2.2 Systematic and Statistical Uncertainty

In an earlier lesson, students explored how statistical and systematic uncertainty can make data suggest false inferences, through either random variation within an insufficiently large sample (statistical uncertainty) or some systematic bias in the data (systematic uncertainty). Both of these sources of uncertainty can lead to either false positives or false negatives.

3.1 Probabilistic Reasoning

In this lesson, students will learn to use numeric credence levels to track and communicate their confidence levels about various claims and predictions. Credence levels can be a good way to make explicit the likelihood of false positives and false negatives in a given case, and one's tolerance for the two types of errors.

4.2 Finding Patterns in Random Noise

In the previous lesson, students explored how we often mistake noise for signal, because we are always looking for patterns through which to make sense of the world. This constant search for patterns can lead to false positives. On the other hand, sometimes there is so much noise that we miss real patterns, leading to false negatives.

Later Lessons

5.2 Scientific Optimism

"Scientific optimism" is a can-do attitude adopted by scientists by which they convince themselves that even difficult problems are solvable. With this "scientific optimism," scientists can successfully take on problems that take years or even decades to solve, with hundreds of steps and iterations involved in developing techniques, inventing technologies, collecting and analyzing data. Scientific optimism may also help scientists continue to detect false positives and false negatives in their previous work, iteratively improving our scientific understanding as we accumulate more evidence.

Takeaways

After this lesson, students should

Understand that there is always the inevitability of a trade-off—for a given test, one can reduce the risk of false positives by increasing the risk of false negatives, and vice versa. You make this trade-off when you determine what the threshold is (e.g. convict someone if you're 99% or 70% certain that they're guilty).
Understand that whether false positive or negative is preferable depends on a human values judgement on the consequences of either: (e.g. "innocent people shouldn't be in jail") and risks (e.g. dangerous people committing more crimes).
(Data science) Recognise that maximising the overall accuracy of a classification problem may sometimes be undesirable.
(Data science) Identify where a subtle classification problem may be hidden in a machine learning application.

False Positive/Type I Errors

A test yields a positive result, but in fact the condition is not present (e.g. your test says you have COVID, but you don't).

False Negative/Type II Errors

A test yields a negative result, but in fact the condition is present (e.g. your test says you don't have COVID, but you do).

Threshold

The cut-off quantity chosen for an indicator variable to decide when a given instance will be supposed positive vs. negative.

Unless the indicator variable is a perfect indicator of the underlying condition, there will be some probability of false positives and/or false negatives for any given threshold. Choosing a threshold is thus partly a matter of deciding one's relative tolerance for false positives and false negatives; a higher threshold will lead to more false negatives, a lower threshold to more false positives.

Incoming Asteroid

"The oncoming asteroid has only a 1% chance of hitting Earth. But if it does, life on Earth will be destroyed. It'll be expensive to stop the asteroid, but the risk is bad enough it's worth it."

Cancer Treatment

"I asked them to imagine that they faced a choice between two types of radiation therapy for early-stage breast cancer. The first treatment would leave them with a 15% chance of local recurrence and a 10% chance of moderate or severe breast fibrosis. The second treatment would leave them with only an 8% chance of local recurrence but a 30% chance of moderate or severe fibrosis. The radiation oncologists raised their hands in almost equal numbers for the two treatments. Some believed the higher risk of fibrosis was unacceptable, given the treatability of most local recurrences, whereas others believed the trauma of recurrence outweighed the discomfort of fibrosis. But sometimes physicians' values differ in important ways from those of many patients. When such value judgments are incorporated into professional treatment guidelines, without any explicit acknowledgment that a reasonable patient might choose an alternative course of treatment, they take potential choices away from patients." (Source)

There have been many people who still caught the virus even after a "negative" test result. These tests are just unreliable.

Most medical tests will inevitably produce false positive or false negative results at a certain rate. Although no conclusion can be totally accurate, it is possible to quantify the rate of such errors, so that the test result can be used to assess the probabilistic risks and odds of having a disease. See 3.1 Probabilistic Reasoning.

Additional Content

You must be logged in to see this content.