Question 1

Why is the Bayesian average lower than the simple average?

Accepted Answer

The Bayesian average blends your data with a prior assumption (default: 3.0 stars with 100 reviews' weight). For items with few reviews, the result is pulled toward the prior. As reviews accumulate, the data overwhelms the prior and the Bayesian average converges to the simple average. This prevents a single 5-star review from ranking above a well-reviewed 4.5-star item.

Question 2

What is the Wilson lower bound used for?

Accepted Answer

Wilson lower bound is ideal for ranking items by approval rate. It answers: "Given this sample size, what's the lowest percentage of positive ratings we can be 95% confident about?" A product with 10/10 positive ratings gets a lower Wilson score than one with 95/100, because the second has more evidence. Reddit uses a variant of this for comment ranking.

Question 3

How does entropy relate to rating quality?

Accepted Answer

Entropy measures the spread of ratings across star levels. Low entropy means ratings cluster at one level (strong consensus — good or bad). High entropy means ratings are spread evenly (no consensus, controversial item). Maximum entropy occurs when each star level has exactly 20% of ratings.

Question 4

What is polarity and why does it matter?

Accepted Answer

Polarity measures how bimodal the distribution is — how much of the ratings are at the extremes (1★ and 5★) versus the middle (2-4★). A highly polarized product has fans who love it and critics who hate it. The average might be 3 stars, but the experience is nothing like "average" — it depends on who you are.

Question 5

How should I set the Bayesian prior weight?

Accepted Answer

Set it to the typical number of reviews for items in your category. If most items have ~200 reviews, use 200 as the prior. This ensures new items with few reviews aren't artificially inflated. IMDB uses approximately 25,000 as the prior for their Top 250 list.

Question 6

Why might the median and mean disagree?

Accepted Answer

If ratings are skewed (e.g., mostly 5-star with some 1-star), the mean is pulled down by the low ratings while the median stays at 5. The median represents the "typical" review; the mean represents the overall balance. Large disagreements indicate a skewed distribution.

Five-Star Rating Calculator

About the Five-Star Rating Calculator

Why Use This Five-Star Rating Calculator?

How to Use This Calculator

Formula

Example Calculation

Tips & Best Practices

How Major Platforms Rank

The J-Curve Problem

Designing Fair Rating Systems

Frequently Asked Questions