Question 1

What is the chi-square test and when do you use it?

Accepted Answer

The chi-square test is a non-parametric statistical hypothesis test for categorical data. It compares observed frequencies to expected frequencies to determine whether the difference is statistically significant. Use the goodness of fit test when you have one categorical variable and want to test whether observed counts match a hypothesized distribution. Use the test of independence when you have two categorical variables and want to test whether they are related or independent.

Question 2

What is the chi-square formula?

Accepted Answer

Chi-square formula: chi-squared equals the sum of (observed minus expected) squared divided by expected, across all categories or cells. Written as chi-squared equals sigma of (O minus E) squared divided by E. For goodness of fit, expected frequencies come from a hypothesized distribution. For independence, expected frequency for each cell equals row total times column total divided by grand total.

Question 3

How do you calculate degrees of freedom for chi-square?

Accepted Answer

For goodness of fit: df equals k minus 1, where k is the number of categories. For test of independence: df equals (number of rows minus 1) times (number of columns minus 1). Example: a 3-category goodness of fit test has df equals 2. A 3-by-4 contingency table has df equals (3-1) times (4-1) equals 6.

Question 4

What is Cramér's V and how do you interpret it?

Accepted Answer

Cramér's V is an effect size measure for chi-square tests of independence. It equals the square root of chi-squared divided by (n times the minimum of rows minus 1 and columns minus 1). It ranges from 0 (no association) to 1 (perfect association). Conventional interpretation by Cohen: 0.10 is small effect, 0.30 is medium effect, 0.50 is large effect. Cramér's V is preferred over phi for tables larger than 2 by 2.

Question 5

What is the expected frequency rule for chi-square tests?

Accepted Answer

All expected cell frequencies should be at least 5 for the chi-square approximation to be valid. If more than 20 percent of cells have expected frequency below 5, the chi-square test may be unreliable. Solutions: combine categories to increase expected frequencies, use Fisher's exact test for 2 by 2 tables with small samples, or collect more data. The chi-square test becomes invalid with very small expected frequencies.

Question 6

What is the difference between chi-square goodness of fit and test of independence?

Accepted Answer

Goodness of fit tests one categorical variable against a theoretical distribution. Example: are customers equally distributed across weekdays? Expected frequencies come from theory or prior knowledge. Test of independence tests two categorical variables in a cross-tabulation. Example: is gender associated with product preference? Expected frequencies are calculated from the data itself using row and column totals.

Question 7

What is the null hypothesis in a chi-square test?

Accepted Answer

Goodness of fit null hypothesis: the observed distribution matches the expected distribution (the data fits the theoretical model). Alternative: the observed distribution differs from expected. Independence null hypothesis: the two categorical variables are independent (knowing one tells you nothing about the other). Alternative: the two variables are associated or related.

Question 8

How do you interpret chi-square test results?

Accepted Answer

If p-value is less than alpha (usually 0.05): reject the null hypothesis. For goodness of fit this means the data does not fit the expected distribution. For independence this means the two variables are significantly associated. If p-value is greater than or equal to alpha: fail to reject the null. The evidence is insufficient to conclude a significant difference or association. Always report chi-squared, df, p-value, and effect size together.

Question 9

What is a chi-square test of independence used for?

Accepted Answer

Common uses: marketing surveys testing if product preference depends on demographics. Medical research testing if a treatment outcome is independent of patient characteristics. Quality control testing if defect rates are independent of production shift. Social science testing if voting behavior is independent of education level. Biology testing if gene expression is independent of experimental condition. Any study with two categorical variables measured on the same subjects.

Question 10

What assumptions does the chi-square test require?

Accepted Answer

Four assumptions: independence (each observation belongs to exactly one cell), adequate sample size (all expected frequencies at least 5), categorical data (not means or continuous measurements), and random sampling. If independence is violated use McNemar's test for matched pairs. If expected frequencies are too small combine categories or use Fisher's exact test. Chi-square tests cannot be applied to percentages or means directly.

Question 11

When is chi-square greater than or equal to its critical value?

Accepted Answer

Reject the null hypothesis when the calculated chi-square statistic exceeds the critical value from the chi-square distribution table for your chosen alpha level and degrees of freedom. Example: for df equals 2 and alpha equals 0.05, the critical value is 5.991. If your calculated chi-squared exceeds 5.991, p is less than 0.05 and you reject H0. Using a calculator to get the exact p-value is more precise than comparing to a table.

Question 12

What are real-world examples of chi-square tests?

Accepted Answer

Goodness of fit: testing if a six-sided die is fair by comparing 100 observed rolls to 16.67 expected per face. Testing if customer complaints are equally distributed across days of the week. Testing if genetic offspring ratios match Mendelian expected ratios. Independence: testing if smoking status is associated with lung disease. Testing if survey response depends on respondent age group. Testing if defect rates differ across suppliers. All involve categorical count data.

df	α=0.10	α=0.05	α=0.01	Common Use Case
1	2.706	3.841	6.635	2-category GOF, 2×2 table
2	4.605	5.991	9.210	3-category GOF, 2×3 table
3	6.251	7.815	11.345	4-category GOF, 2×4 table
4	7.779	9.488	13.277	5-category GOF, 2×5 / 3×3 table
5	9.236	11.070	15.086	6-category GOF, fair die test
6	10.645	12.592	16.812	3×4 contingency table
9	14.684	16.919	21.666	4×4 contingency table
12	18.549	21.026	26.217	4×5 contingency table

Chi-Square Test Calculator

Sources & Methodology

Chi-Square Test — Goodness of Fit, Independence, Cramér’s V & Full Guide

What is the Chi-Square Formula and How Does It Work?

Chi-Square Test of Independence — Contingency Tables Explained

Cramér’s V — Effect Size for Chi-Square Independence Tests

Chi-Square Reference Table: Critical Values

Missing a Statistics Calculator?