What is a p-value and how do I interpret it?

A p-value is the probability of observing results at least as extreme as the data assuming the null hypothesis is true. A small p-value like less than 0.05 provides evidence against the null hypothesis. It is NOT the probability that the null hypothesis is true.

Which distribution should I use for my test?

Use Z for large samples with known sigma. Use t for small samples with unknown sigma. Use chi-square for goodness of fit and independence tests. Use F for comparing variances or ANOVA.

What are common misconceptions about p-values?

Common mistakes include thinking p is the probability H0 is true or that 1 minus p is the probability H1 is true or that a non-significant result proves no effect exists. The p-value only measures evidence against H0 under the assumption H0 is true.

P-Value Calculator

Z · t · χ² · F tests · distribution shading

Test Type

Tail Type

Z-Score

Standard normal test statistic

Quick Examples

Result

📊

Enter parameters and click Calculate

Compute p-values for Z, T, Chi-square, or F test statistics.

Distribution Visualization

Python Compiler

What Is a P-Value?

A p-value is the probability of observing a test statistic at least as extreme as the one computed from sample data, assuming the null hypothesis (H₀) is true. It is a fundamental concept in hypothesis testing and helps researchers decide whether to reject H₀.

📏

Probability Measure

The p-value quantifies how likely observed data (or more extreme) would occur if H₀ were true. Ranges from 0 to 1.

⚖️

Evidence Against H₀

A smaller p-value provides stronger evidence against the null hypothesis. It does NOT measure the probability that H₀ is true.

🎯

Decision Tool

Compare p-value to significance level α. If p ≤ α, reject H₀. Common thresholds: 0.05, 0.01, 0.10.

P-Value Interpretation Guide

The significance of a p-value depends on the chosen α level. Below are commonly used thresholds and their interpretations.

p < 0.001

Highly Significant — Very strong evidence against the null hypothesis. Often denoted with *** in research papers.

p < 0.05

Significant — Sufficient evidence to reject H₀ at the standard 5% level. The most common threshold in science.

p < 0.10

Marginal — Weak evidence against H₀. Sometimes called “trending toward significance.” Use with caution.

p ≥ 0.10

Not Significant — Insufficient evidence to reject H₀. Does not prove H₀ is true, only that data is consistent with it.

Test Distributions

Z-Test: p = P(Z ≥ |z|) for standard normal. Used when σ is known or n is large (≥30).

T-Test: p = P(T ≥ |t|) with df = n − 1. Used when σ is unknown and sample is small.

Chi-Square: p = P(χ² ≥ x²) with df = k − 1. Used for goodness-of-fit and independence tests.

F-Test: p = P(F ≥ f) with df₁ and df₂. Used for ANOVA and comparing two variances.

Distribution	Parameters	Common Use
Z (Normal)	None (standard)	Large samples, known σ, proportions
t (Student’s)	df = n − 1	Small samples, unknown σ
χ² (Chi-square)	df = k − 1	Categorical data, goodness-of-fit
F (Fisher)	df₁, df₂	ANOVA, variance comparison

One-Tailed vs Two-Tailed Tests

One-Tailed (Directional)

Tests for an effect in a specific direction (e.g., mean is greater than or less than a value). All α is concentrated in one tail, giving more power for that direction.

Two-Tailed (Non-directional)

Tests for any difference in either direction. The α is split between both tails (α/2 each). More conservative but appropriate when the direction is not pre-specified.

Important: The choice between one-tailed and two-tailed tests must be made before looking at the data. Choosing after seeing results inflates the false positive rate.

Common Misconceptions About P-Values

❌ “P is the probability H₀ is true”

Incorrect. The p-value is the probability of the data (or more extreme) given H₀ is true. It is P(data | H₀), not P(H₀ | data). Bayesian methods are needed for the latter.

❌ “1 − p = probability H₁ is true”

Incorrect. A p-value of 0.03 does not mean there is a 97% chance the alternative is true. The p-value tells you about the data under H₀, not the probability of any hypothesis.

❌ “Not significant = no effect”

Incorrect. Failure to reject H₀ does not prove H₀. It may simply mean the sample size was too small to detect the effect. “Absence of evidence is not evidence of absence.”

✅ Correct Interpretation

Correct: “Assuming H₀ is true, the probability of observing a test statistic as extreme as or more extreme than the one observed is p.” Always interpret in context.

Frequently Asked Questions

A p-value is the probability of observing results at least as extreme as the data, assuming the null hypothesis is true. A small p-value (e.g., less than 0.05) provides evidence against the null hypothesis. It is not the probability that the null hypothesis is true.

Use a one-tailed test when you have a directional hypothesis (e.g., the mean is greater than a value). Use two-tailed when you test for any difference in either direction. The choice must be made before looking at the data.

Use Z for large samples with known σ. Use t for small samples with unknown σ. Use chi-square for goodness-of-fit and independence tests. Use F for comparing variances or ANOVA.

The most common is α = 0.05 (95% confidence). For high-stakes decisions, use 0.01. For exploratory research, 0.10 is sometimes used. The threshold should be chosen before conducting the test.

No. Statistical significance does not equal practical significance. A large sample can produce a tiny p-value for a trivially small effect. Always report effect sizes alongside p-values to assess real-world importance.

Common mistakes include thinking p is the probability H₀ is true, or that 1 − p is the probability H₁ is true, or that a non-significant result proves no effect exists. The p-value only measures evidence against H₀ under the assumption H₀ is true.

Support This Free Tool

Every coffee helps keep the servers running. Every book sale funds the next tool I'm dreaming up. You're not just supporting a site — you're helping me build what developers actually need.

500K+ users

200+ tools

100% private

☕ One-time support Buy me a coffee 📚 Learn & support 9-Book Bundle - $9 𝕏 Stay updated Follow @anish2good

Privacy Guarantee: Private keys you enter or generate are never stored on our servers. All tools are served over HTTPS.

Menu

⭐ Popular Tools

🕒 Recently Used

📁 All Categories

Quick Links

Support

P-Value Calculator

Result

Enter parameters and click Calculate

Distribution Visualization

Python Compiler

What Is a P-Value?

Probability Measure

Evidence Against H₀

Decision Tool

P-Value Interpretation Guide

p < 0.001

p < 0.05

p < 0.10

p ≥ 0.10

Test Distributions

One-Tailed vs Two-Tailed Tests

One-Tailed (Directional)

Two-Tailed (Non-directional)

Common Misconceptions About P-Values

❌ “P is the probability H₀ is true”

❌ “1 − p = probability H₁ is true”

❌ “Not significant = no effect”

✅ Correct Interpretation

Frequently Asked Questions

Support This Free Tool

Result

Enter parameters and click Calculate

Distribution Visualization

Python Compiler

Related Tools

What Is a P-Value?

Probability Measure

Evidence Against H₀

Decision Tool

P-Value Interpretation Guide

p < 0.001

p < 0.05

p < 0.10

p ≥ 0.10

Test Distributions

One-Tailed vs Two-Tailed Tests

One-Tailed (Directional)

Two-Tailed (Non-directional)

Common Misconceptions About P-Values

❌ “P is the probability H₀ is true”

❌ “1 − p = probability H₁ is true”

❌ “Not significant = no effect”

✅ Correct Interpretation

Frequently Asked Questions

Support This Free Tool