Box Plot Calculator

Generate box-and-whisker plots with five-number summary, Tukey fences, outlier detection, percentile table, and Bowley skewness. Interactive SVG visualization.

About the Box Plot Calculator

The box plot calculator creates a complete box-and-whisker diagram from your data with interactive outlier detection and percentile analysis. Box plots are one of the most informative data visualizations in statistics — they show the median, spread, skewness, and outliers in a single compact chart.

This tool computes the five-number summary (min, Q1, median, Q3, max), calculates inner and outer Tukey fences, identifies mild and extreme outliers, and renders an SVG box plot. You can choose between inclusive and exclusive quartile methods and customize the fence multiplier.

Paste your data, select options, and get a publication-ready box plot with complete statistical details — including Bowley skewness, percentile breakdown, and color-coded outlier table. Check the example with realistic values before reporting. Use the steps shown to verify rounding and units. Cross-check this output using a known reference case. Use the example pattern when troubleshooting unexpected results. Validate that outputs match your chosen standards.

Why Use This Box Plot Calculator?

Box plots are the single best tool for quickly summarizing data distribution. This calculator gives you the complete picture: five-number summary, Tukey fences, outlier identification, skewness measurement, and a ready-to-use SVG visualization.

Whether you're comparing groups, screening for outliers, or presenting data summaries, the box plot calculator delivers every metric you need alongside the visual representation.

How to Use This Calculator

  1. Enter your data as comma-separated or space-separated numbers (minimum 5 values).
  2. Select the quartile method: inclusive (median in both halves) or exclusive (median excluded).
  3. Adjust the fence multiplier (default 1.5 for standard Tukey fences, use 3 for extreme outlier detection only).
  4. Toggle outlier display to show or hide outlier circles on the box plot.
  5. Read the five-number summary and fence values from the detail table.
  6. Check the outlier table for flagged values with their severity and fence distance.
  7. Review the percentile table for a complete positional breakdown.

Formula

IQR = Q3 − Q1. Lower fence = Q1 − k × IQR. Upper fence = Q3 + k × IQR. Standard: k = 1.5 (mild), k = 3 (extreme). Bowley skewness = (Q3 + Q1 − 2 × Median) / IQR.

Example Calculation

Result: Min=68, Q1=73, Median=82.5, Q3=89, Max=95, IQR=16, No outliers

The 12 test scores have a median of 82.5 with an IQR of 16. All values fall within the fences (Q1 − 1.5×IQR = 49, Q3 + 1.5×IQR = 113), so there are no outliers. The box plot shows a slight left skew.

Tips & Best Practices

Anatomy of a Box Plot

The box plot was invented by John Tukey in 1970 as part of exploratory data analysis (EDA). The "box" spans Q1 to Q3, capturing the middle 50% (the interquartile range). The median line inside the box shows central tendency. Whiskers extend to the most extreme values within the fences, and outliers appear as individual points beyond the whiskers.

Interpreting Box Plot Shapes

A symmetric box plot has the median centered in the box with equal-length whiskers. A right-skewed plot has the median closer to Q1, with a longer right whisker and upper outliers. A left-skewed plot is the mirror image. The relative lengths of whiskers and position of the median tell you about data asymmetry at a glance.

Box Plots vs. Violin Plots

Modern alternatives like violin plots combine box plots with kernel density estimates, showing the full distribution shape. However, box plots remain preferred for their simplicity and interpretability, especially when comparing many groups. They clearly show outliers, which violin plots often obscure.

Frequently Asked Questions

What does a box plot show?

A box plot shows the five-number summary: minimum, Q1 (25th percentile), median (50th percentile), Q3 (75th percentile), and maximum. The box spans Q1 to Q3 (the IQR, containing the middle 50% of data). Whiskers extend to the most extreme non-outlier values. Outliers are shown as individual points.

How are outliers determined in a box plot?

Values beyond the "fences" are outliers. The standard (Tukey) fences are Q1 − 1.5×IQR and Q3 + 1.5×IQR. Values beyond these are "mild outliers." Values beyond Q1 − 3×IQR or Q3 + 3×IQR are "extreme outliers." These thresholds were chosen by John Tukey because they work well for normally distributed data.

What's the difference between inclusive and exclusive quartile methods?

The inclusive method includes the median in both the lower and upper halves of the data (used in many textbooks). The exclusive method excludes the median from both halves. Results differ most for odd-sized datasets. The difference is usually small, but be consistent in your analysis.

When should I use a box plot instead of a histogram?

Box plots are better for comparing multiple groups, showing outliers, and displaying data compactly. Histograms are better for showing the detailed shape of a single distribution (bimodality, gaps). Use box plots for comparison and histograms for detailed distribution analysis.

What does Bowley skewness measure?

Bowley skewness (quartile skewness) measures symmetry using quartiles: (Q3 + Q1 − 2×Median) / IQR. It ranges from −1 to +1. Zero means perfectly symmetric quartiles. Unlike Pearson's skewness, it's robust to outliers since it only uses quartiles.

How many data points do I need for a meaningful box plot?

You need at least 5 values to compute the five-number summary. However, box plots become more meaningful with 20+ data points. With fewer than 10 values, individual data displays (dot plots, stem-and-leaf plots) may be more informative.

Related Pages