Q: What is statistical significance and how does it differ from practical significance?

Statistical significance (p < 0.05) means the result is unlikely to have occurred by chance alone, given the null hypothesis. Practical significance (effect size) measures whether the effect is large enough to matter in the real world. With large samples, even trivially small effects become statistically significant. Example: a study with n = 500 found meditation reduced anxiety with p < 0.001 (highly significant). The effect size was Cohen's d = 0.10 — a 1.5-point reduction on a 100-point scale. Statistically significant, practically negligible. Always report effect size (Cohen's d, r, eta-squared) alongside p-values. A significant p-value with a small effect size is a warning that sample size drove the significance, not the magnitude of the effect.

Q: How do I calculate standard deviation step by step?

Step 1: Calculate the mean. Step 2: Subtract the mean from each data point and square the result. Step 3: Sum all the squared differences. Step 4: Divide by N (for population) or N-1 (for sample). Step 5: Take the square root. Example: data = {4, 7, 13, 2}. Mean = 26/4 = 6.5. Squared deviations: (4-6.5)^2=6.25, (7-6.5)^2=0.25, (13-6.5)^2=42.25, (2-6.5)^2=20.25. Sum = 69. Sample variance = 69/(4-1) = 23. Sample std dev = sqrt(23) = 4.80. Population std dev = sqrt(69/4) = sqrt(17.25) = 4.15.

Q: What is a percentile and how do I calculate it?

A percentile indicates the value below which a given percentage of observations fall. The 70th percentile means 70% of observations are below that value. To find the percentile rank of a score: Percentile rank = (Number of scores below your score / Total scores) × 100. To find the value at a given percentile in a sorted dataset: Locate the index = (Percentile / 100) × N. If the index is a whole number, average that index and the next. If it is not, round up and take that value. Example: In {2, 4, 7, 9, 11, 15}, what is the 75th percentile? Index = 0.75 × 6 = 4.5. Round up = 5th value = 11. 75th percentile is 11.

Q: What are quartiles and how do they differ from percentiles?

Quartiles divide a dataset into four equal parts. Q1 (first quartile) = 25th percentile. Q2 = median = 50th percentile. Q3 = 75th percentile. The Interquartile Range (IQR) = Q3 minus Q1, representing the middle 50% of data. IQR is used to identify outliers using the Tukey fence method: Lower fence = Q1 minus 1.5 times IQR. Upper fence = Q3 plus 1.5 times IQR. Any value outside these fences is a potential outlier. For {2, 4, 7, 9, 11, 15, 20}: Q1 = 4, Q3 = 15, IQR = 11. Lower fence = 4 minus 16.5 = -12.5. Upper fence = 15 + 16.5 = 31.5. No outliers in this dataset.

Q: How do I calculate sample size for a survey?

Sample size = (z^2 × p × (1-p)) / e^2, where z = z-score for confidence level (1.96 for 95%, 2.576 for 99%), p = expected proportion (use 0.5 for maximum sample size when unknown), e = margin of error as a decimal. For 95% confidence, 5% margin of error: n = (1.96^2 × 0.5 × 0.5) / 0.05^2 = (3.8416 × 0.25) / 0.0025 = 0.9604 / 0.0025 = 384. Key insight: sample size has a quadratic relationship with precision. To halve your margin of error from 5% to 2.5%, you need 4 times the sample size (384 → 1537). Population size matters far less than people think — a sample of 385 estimates a proportion within 5% margin of error whether the population is 10,000 or 10 million.

Q: What is Bessel's correction and why do we divide by N-1?

Bessel's correction is the use of N-1 (instead of N) in the denominator when calculating sample variance and standard deviation. When you calculate the mean of a sample, you use the sample mean as an estimate of the population mean. The sample data points, by definition, are all closer to the sample mean than they would be to the true population mean (they were used to calculate the sample mean). This causes systematic underestimation of the true population variance when you divide by N. Dividing by N-1 corrects for this bias. For a sample of N observations, there are only N-1 independent pieces of information about variation because the Nth data point is constrained by the requirement that all N points average to the sample mean.

Q: What is Cohen's d and how do I interpret effect size?

Cohen's d = (Mean1 - Mean2) / Pooled standard deviation. It measures the standardised difference between two means. Jacob Cohen's benchmarks (1988): Small effect = 0.2 (means differ by 0.2 standard deviations). Medium effect = 0.5. Large effect = 0.8. Practical interpretation: d = 0.2 means about 58% overlap between the two distributions — hard to see by eye. d = 0.5 means 33% separation — visible in data. d = 0.8 means 21% overlap — clearly different groups. Context matters: d = 0.2 for a drug with no side effects treating a terminal illness might be very meaningful. d = 0.8 for a mildly inconvenient intervention in a low-stakes context might not be worth the cost. Cohen's benchmarks are guidelines, not definitions of meaningful effects.

Q: How do I sort numbers in ascending order?

Ascending order = smallest to largest. Steps: (1) Find the minimum value and place it first. (2) Find the next smallest from remaining values. (3) Repeat until all values are placed. For {14, 3, 7, 21, 1, 9}: sorted ascending = {1, 3, 7, 9, 14, 21}. Descending order = largest to smallest = {21, 14, 9, 7, 3, 1}. Ascending order is the first step required before calculating median, quartiles, percentiles, and IQR — many statistical calculation errors stem from operating on unsorted data. The ascending order calculator takes any list of numbers and returns them sorted with the rank of each value.

Question 1

What is the difference between population and sample standard deviation?

Accepted Answer

Population standard deviation (sigma) divides by N — the total number of data points. Used when you have the entire population. Sample standard deviation (s) divides by N-1 — Bessel's correction. Used when your data is a sample drawn from a larger population. Why N-1? When estimating population variance from a sample, dividing by N systematically underestimates the true population variance. Dividing by N-1 corrects for this bias and produces an unbiased estimator. The difference is small for large samples (N=100: N-1/N = 99/100 = 1% difference) but significant for small samples (N=5: N-1/N = 4/5 = 25% difference). Most statistical software and calculators default to sample standard deviation (N-1).

Question 2

How do I interpret a p-value correctly?

Accepted Answer

A p-value is the probability of observing data as extreme as (or more extreme than) your results, ASSUMING the null hypothesis is true. P = 0.03 means: if the null hypothesis were true, there would be only a 3% chance of seeing data this extreme or more extreme by random chance alone. What p-value is NOT: it is not the probability the null hypothesis is true. It is not the probability your result is real or correct. It is not the probability the study will replicate. It is not a measure of effect size. A JAMA survey found that 100% of medical residents misinterpreted p-values despite 88% expressing confidence in their understanding. A result with p < 0.001 can describe a statistically significant but practically meaningless effect if the effect size (Cohen's d) is tiny.

Question 3

What is statistical significance and how does it differ from practical significance?

Accepted Answer

Statistical significance (p < 0.05) means the result is unlikely to have occurred by chance alone, given the null hypothesis. Practical significance (effect size) measures whether the effect is large enough to matter in the real world. With large samples, even trivially small effects become statistically significant. Example: a study with n = 500 found meditation reduced anxiety with p < 0.001 (highly significant). The effect size was Cohen's d = 0.10 — a 1.5-point reduction on a 100-point scale. Statistically significant, practically negligible. Always report effect size (Cohen's d, r, eta-squared) alongside p-values. A significant p-value with a small effect size is a warning that sample size drove the significance, not the magnitude of the effect.

Question 4

How do I calculate standard deviation step by step?

Accepted Answer

Step 1: Calculate the mean. Step 2: Subtract the mean from each data point and square the result. Step 3: Sum all the squared differences. Step 4: Divide by N (for population) or N-1 (for sample). Step 5: Take the square root. Example: data = {4, 7, 13, 2}. Mean = 26/4 = 6.5. Squared deviations: (4-6.5)^2=6.25, (7-6.5)^2=0.25, (13-6.5)^2=42.25, (2-6.5)^2=20.25. Sum = 69. Sample variance = 69/(4-1) = 23. Sample std dev = sqrt(23) = 4.80. Population std dev = sqrt(69/4) = sqrt(17.25) = 4.15.

Question 5

What is a percentile and how do I calculate it?

Accepted Answer

A percentile indicates the value below which a given percentage of observations fall. The 70th percentile means 70% of observations are below that value. To find the percentile rank of a score: Percentile rank = (Number of scores below your score / Total scores) × 100. To find the value at a given percentile in a sorted dataset: Locate the index = (Percentile / 100) × N. If the index is a whole number, average that index and the next. If it is not, round up and take that value. Example: In {2, 4, 7, 9, 11, 15}, what is the 75th percentile? Index = 0.75 × 6 = 4.5. Round up = 5th value = 11. 75th percentile is 11.

Question 6

What are quartiles and how do they differ from percentiles?

Accepted Answer

Quartiles divide a dataset into four equal parts. Q1 (first quartile) = 25th percentile. Q2 = median = 50th percentile. Q3 = 75th percentile. The Interquartile Range (IQR) = Q3 minus Q1, representing the middle 50% of data. IQR is used to identify outliers using the Tukey fence method: Lower fence = Q1 minus 1.5 times IQR. Upper fence = Q3 plus 1.5 times IQR. Any value outside these fences is a potential outlier. For {2, 4, 7, 9, 11, 15, 20}: Q1 = 4, Q3 = 15, IQR = 11. Lower fence = 4 minus 16.5 = -12.5. Upper fence = 15 + 16.5 = 31.5. No outliers in this dataset.

Question 7

How do I calculate sample size for a survey?

Accepted Answer

Sample size = (z^2 × p × (1-p)) / e^2, where z = z-score for confidence level (1.96 for 95%, 2.576 for 99%), p = expected proportion (use 0.5 for maximum sample size when unknown), e = margin of error as a decimal. For 95% confidence, 5% margin of error: n = (1.96^2 × 0.5 × 0.5) / 0.05^2 = (3.8416 × 0.25) / 0.0025 = 0.9604 / 0.0025 = 384. Key insight: sample size has a quadratic relationship with precision. To halve your margin of error from 5% to 2.5%, you need 4 times the sample size (384 → 1537). Population size matters far less than people think — a sample of 385 estimates a proportion within 5% margin of error whether the population is 10,000 or 10 million.

Question 8

What is Bessel's correction and why do we divide by N-1?

Accepted Answer

Bessel's correction is the use of N-1 (instead of N) in the denominator when calculating sample variance and standard deviation. When you calculate the mean of a sample, you use the sample mean as an estimate of the population mean. The sample data points, by definition, are all closer to the sample mean than they would be to the true population mean (they were used to calculate the sample mean). This causes systematic underestimation of the true population variance when you divide by N. Dividing by N-1 corrects for this bias. For a sample of N observations, there are only N-1 independent pieces of information about variation because the Nth data point is constrained by the requirement that all N points average to the sample mean.

Question 9

What is Cohen's d and how do I interpret effect size?

Accepted Answer

Cohen's d = (Mean1 - Mean2) / Pooled standard deviation. It measures the standardised difference between two means. Jacob Cohen's benchmarks (1988): Small effect = 0.2 (means differ by 0.2 standard deviations). Medium effect = 0.5. Large effect = 0.8. Practical interpretation: d = 0.2 means about 58% overlap between the two distributions — hard to see by eye. d = 0.5 means 33% separation — visible in data. d = 0.8 means 21% overlap — clearly different groups. Context matters: d = 0.2 for a drug with no side effects treating a terminal illness might be very meaningful. d = 0.8 for a mildly inconvenient intervention in a low-stakes context might not be worth the cost. Cohen's benchmarks are guidelines, not definitions of meaningful effects.

Question 10

How do I sort numbers in ascending order?

Accepted Answer

Ascending order = smallest to largest. Steps: (1) Find the minimum value and place it first. (2) Find the next smallest from remaining values. (3) Repeat until all values are placed. For {14, 3, 7, 21, 1, 9}: sorted ascending = {1, 3, 7, 9, 14, 21}. Descending order = largest to smallest = {21, 14, 9, 7, 3, 1}. Ascending order is the first step required before calculating median, quartiles, percentiles, and IQR — many statistical calculation errors stem from operating on unsorted data. The ascending order calculator takes any list of numbers and returns them sorted with the rank of each value.

Question 11

What is the relationship between confidence interval and sample size?

Accepted Answer

Confidence interval width is proportional to 1 divided by the square root of sample size. This creates the quadratic relationship: to halve the confidence interval width (double the precision), you need 4 times the sample size. To reduce the margin of error by a factor of 3, you need 9 times the sample size. A 95% confidence interval for a proportion at n=100 has margin of error ±9.8%. At n=400: ±4.9%. At n=1600: ±2.5%. The diminishing returns are significant — going from n=100 to n=400 (4× effort) halves the margin of error. Going from n=400 to n=1600 (4× more effort again) halves it again. This is why survey researchers carefully calculate the minimum sample size needed rather than collecting as many responses as possible.

Question 12

Does CalculatorCove store my statistics data?

Accepted Answer

No. Every statistics calculation runs entirely in your browser. Your datasets, test statistics, and all other inputs never leave your device. Nothing is logged or stored. Statistical calculators provide standard mathematical computations — results are only as valid as the data entered and the statistical assumptions met. Always verify that your data meets the assumptions of the test being applied (normality for parametric tests, independence of observations, appropriate scale of measurement). For research or publication purposes, verify results with dedicated statistical software.

Cohen’s d	Classification	Distribution Overlap	Practical Example
0.10	Negligible	~92% overlap	1.5-pt change on 100-pt scale (often noise)
0.20	Small	~85% overlap	Height difference between 15–16 year olds
0.50	Medium	~67% overlap	IQ difference between clerical and semi-skilled workers
0.80	Large	~53% overlap	IQ difference between PhD and typical college freshman
1.20	Very Large	~37% overlap	Substantial clinically meaningful difference
2.0+	Huge	<22% overlap	Group differences clearly visible without statistics

Sample Size (n)	Margin of Error	To Halve MOE	Notes
100	±9.8%	→ need n=400	Rough estimates only
385	±5.0%	→ need n=1,537	Common survey standard
600	±4.0%	→ need n=2,401	Political polling minimum
1,067	±3.0%	→ need n=4,268	National survey standard
1,537	±2.5%	→ need n=6,147	High-precision surveys
9,604	±1.0%	→ need n=38,416	Census-level precision

Z-Score	Percentile Rank	Meaning
−3.0	0.13%	Extreme low — rarer than 1 in 750
−2.0	2.28%	Low — bottom 2.3%
−1.0	15.87%	Below average
0.0	50.00%	Exactly at the mean
+1.0	84.13%	Above average
+1.65	95.05%	Top 5% threshold (one-tailed)
+1.96	97.50%	95% confidence interval boundary
+2.0	97.72%	Top 2.3%
+2.576	99.50%	99% confidence interval boundary
+3.0	99.87%	Top 0.13% — rare

Free Statistics Calculators
Standard Deviation, P-Value & More

📚 Sources & Methodology

Standard Deviation, P-Values & Sample Size — The Concepts Behind the Calculations

Standard Deviation — Population (N) vs Sample (N-1) and Why Bessel’s Correction Matters

P-Values — What They Mean, What They Do Not Mean, and Why Experts Get It Wrong

Effect Size — Why Statistical Significance Without Effect Size Is Meaningless

Statistics Reference Tables — Effect Size, Sample Size & Percentile Interpretation

Cohen’s d Effect Size Benchmarks — What Small, Medium, and Large Mean in Practice

Sample Size vs Margin of Error — The Quadratic Relationship

Z-Score and Percentile Reference — Standard Normal Distribution

Which Statistics Calculator to Use — A Practical Guide for Researchers, Analysts & Students

For Descriptive Statistics (Summarising Data)

For Hypothesis Testing

For Survey and Study Design

What Researchers and Analysts Consistently Get Wrong

Frequently Asked Questions — Statistics Calculators

Popular Calculators

Related Calculator Categories

Missing a Statistics Calculator?

Free Statistics CalculatorsStandard Deviation, P-Value & More