MATH 1040 - Introduction to Statistics

6.5 Boxplots

Resources

Reading

Lesson

Boxplots (or box-and-whisker plots)

A boxplot is a very simple graph intended to display the distribution of the quartiles. In order for the graph to be effective, we must start with the scale and the label:

  1. Create a numberline that starts below your minimum and goes beyond your maximum
  2. Label your numberline (what is the data that you are going to show)
  3. Draw lines above the numberline at the locations of
    • The minimum
    • Q1 (1st Quartile)
    • The median
    • Q3 (3rd Quartile)
    • The maximum
  4. Connect Q1 to Q3 to create a box (the median line will be inside the box, essentially dividing the box in two)
  5. Draw a single line from the minimum to Q1 and another from Q3 to the maximum (these are the whiskers)

Interquartile Range

The interquartile range (IQR) is the range between Q1 and Q3. In other words, it is the range of the box in our boxplot.

In short, IQR = Q3 - Q1.

Outliers

Sometimes, our boxplot gets really long whiskers - ridiculously long whiskers. This is because we may have a single point that is ridiculously extreme one way or the other.

There is a way to exclude these extreme points from the plot. Using the IQR, we can judge what a reasonable distance is to consider points valid or extreme. If points are within this distance, we keep them in the graph. Points beyond this reasonable distance are called outliers.

What is this reasonable distance? Any point within \(1.5\cdot IQR\) of the box is reasonable. Any points further than \(1.5\cdot IQR\) from the box is considered an outlier. In short,