Chi-Square Test Calculator

Perform chi-square goodness-of-fit and independence tests online. Get χ² statistic, p-value, Cramér's V, expected frequencies, and standardized residuals.

About the Chi-Square Test Calculator

The chi-square test is one of the most widely used statistical tests for categorical data. It comes in two main flavors: the goodness-of-fit test, which checks whether observed frequencies match an expected distribution, and the test of independence, which determines whether two categorical variables are associated in a contingency table.

This calculator handles both test types. For goodness of fit, enter your observed and expected frequencies. For independence, enter your contingency table data. The calculator returns the χ² statistic, p-value, degrees of freedom, Cramér's V effect size, expected frequencies, standardized residuals, and a clear accept/reject decision.

Chi-square tests appear everywhere: testing whether a die is fair, checking if genetics follow Mendelian ratios, analyzing survey cross-tabulations, evaluating A/B test results with categorical outcomes, and assessing whether customer preferences differ across demographic groups. Check the example with realistic values before reporting. Use the steps shown to verify rounding and units. Cross-check this output using a known reference case.

Why Use This Chi-Square Test Calculator?

Computing chi-square by hand involves squaring differences between observed and expected frequencies for every cell, summing them, looking up critical values in a table, and manually assessing significance. This calculator does it instantly for any table size, plus computes Cramér's V effect size and standardized residuals to identify which cells deviate most. It eliminates errors and lets you focus on interpreting the results.

How to Use This Calculator

Select your test type: Goodness of Fit or Test of Independence.
For goodness of fit: enter observed frequencies and expected frequencies (comma-separated), or use a preset.
For independence: enter the number of rows and columns, then the table data (rows separated by |).
Set your significance level alpha.
Review the χ² statistic, p-value, and decision.
Examine expected frequencies and standardized residuals to see which cells drive the result.
Check Cramér's V to assess the strength of association (independence test).

Formula

Chi-Square Statistic: χ² = Σ (Oᵢ − Eᵢ)² / Eᵢ Degrees of Freedom: Goodness of Fit: df = k − 1 Independence: df = (r − 1)(c − 1) Expected Frequency (Independence): Eᵢⱼ = (Row Total × Col Total) / Grand Total Cramér's V: V = √(χ² / (n × min(r−1, c−1))) Standardized Residual: e = (O − E) / √E

Example Calculation

Result: χ²(2) = 16.8269, p = 0.0002

A 2×3 contingency table with 200 total observations yields a chi-square statistic of 16.83 with 2 degrees of freedom. The p-value of 0.0002 is far below 0.05, indicating a significant association between the row and column variables. Cramér's V = 0.29, suggesting a medium effect size.

Tips & Best Practices

All expected frequencies should be at least 5 for the chi-square approximation to be reliable. If any expected count is below 5, consider Fisher's exact test.
Standardized residuals exceeding ±2 indicate cells that deviate substantially from expectation — these drive the significant result.
Chi-square tests are always right-tailed: large χ² values indicate poor fit or strong association.
Cramér's V ranges from 0 (no association) to 1 (perfect association). Cohen's benchmarks: 0.1 small, 0.3 medium, 0.5 large.
For 2×2 tables, Yates' continuity correction is sometimes applied to improve the chi-square approximation.
Chi-square tests require data in frequency counts, not proportions or percentages. Convert before entering.

Chi-Square Goodness of Fit in Practice

The goodness-of-fit test is the go-to method for checking whether observed data matches an expected probability distribution. Geneticists use it to verify Mendelian ratios (e.g., 9:3:3:1 in dihybrid crosses), quality engineers check if defect counts follow Poisson distributions, and pollsters verify if demographic samples match census proportions.

Understanding the Contingency Table

In a test of independence, each cell of the contingency table represents the frequency of a specific combination of categories. Expected frequencies are computed under the assumption of independence: E = (row total × column total) / grand total. Large deviations between observed and expected values in specific cells drive the chi-square statistic upward. Standardized residuals help pinpoint exactly which combinations are surprising.

Assumptions and Limitations

Chi-square tests assume independent observations (each subject contributes to exactly one cell), adequate expected frequencies (usually ≥ 5 in each cell), and a fixed total sample size. Violating these can lead to unreliable p-values. For matched or paired data, use McNemar's test. For tables with small expected counts, Fisher's exact test is more appropriate.

Frequently Asked Questions

What is the difference between goodness-of-fit and independence tests?

Goodness of fit compares one categorical variable's observed distribution against a theoretical expected distribution. Independence tests whether two categorical variables are related in a contingency table. Both use the same χ² formula but differ in degrees of freedom and interpretation.

What does a significant chi-square test mean?

For goodness of fit, it means the observed distribution significantly differs from the expected one. For independence, it means the two variables are associated (not independent). It does not tell you the direction or magnitude of the relationship — use residuals and Cramér's V for that.

When should I use Fisher's exact test instead?

When any expected cell frequency is below 5, or when your total sample size is small (under 20). Fisher's exact test computes the exact probability without relying on the chi-square distribution approximation.

Can chi-square detect the direction of association?

No. Chi-square only tells you whether an association exists, not its direction. Examine standardized residuals to see which specific cells contribute most and in which direction (positive = more than expected, negative = less).

What is Cramér's V?

Cramér's V is an effect size measure for chi-square tests of independence, ranging from 0 to 1. It's analogous to a correlation coefficient for categorical data and adjusts for table size.

Can I use chi-square for continuous data?

Not directly. Chi-square requires categorical frequency counts. If you have continuous data, you must first bin it into categories (e.g., age ranges). However, binning choices affect results, so parametric tests are usually preferred for continuous data.