Complete guide
Use the calculator above to paste your data and instantly visualise the five-number summary, inter-quartile range, whiskers and outliers on a real box plot, with plain-English interpretation of distribution shape. The box plot is the fastest way to see whether data is symmetric, skewed or contaminated by outliers.
What it is
What is box plot?
A box plot (or box-and-whisker plot) is a graphical summary of a data set using the median, the lower and upper quartiles, and whiskers that extend to the most extreme non-outlier values. Outliers are plotted as individual points. It is one of the most information-dense visualisations in statistics.
Calculation logic
How the calculation works
The five-number summary is: minimum, lower quartile (Q1), median (Q2), upper quartile (Q3), maximum. The box spans Q1 to Q3 (the inter-quartile range, IQR). Whiskers extend to the most extreme values within 1.5 × IQR from the box. Anything beyond that is plotted as an outlier.
Common mistakes
Watch-outs before using box plot
- Reporting only the mean — box plots reveal the distribution behind the average.
- Confusing outliers with errors — outliers might be the most informative data points, not the ones to discard.
- Comparing two box plots informally instead of using a proper hypothesis test when the difference matters.
- Using box plots on very small samples (n < 10) — quartiles become unstable.
- Ignoring skew — non-symmetric box plots invalidate many tests that assume normality.
What to do next
Turn the result into action
When a box plot shows skew, transform the data or use non-parametric tests. When it shows outliers, investigate them — they are often the source of process knowledge, not just nuisance. When it shows wide IQR, run a variation-reduction project.
What is a box plot?
A graphical summary of a data set showing the minimum, lower quartile, median, upper quartile and maximum, with outliers plotted as individual points.
What is the inter-quartile range?
The IQR is the distance from Q1 (25th percentile) to Q3 (75th percentile) — the box itself. It captures the middle 50% of the data and is robust to outliers.
How are outliers defined on a box plot?
Conventionally, any point further than 1.5 × IQR from the nearest end of the box. Some teams use 3 × IQR for "extreme outliers".
When should I use a box plot instead of a histogram?
Use a box plot for compact group comparisons (multiple distributions side-by-side). Use a histogram when shape detail matters more than comparison.
Can box plots be used on non-normal data?
Yes — they make no normality assumption, which is one of their main strengths. They are particularly useful for detecting non-normality before further analysis.