P-Value Calculator

Z & T Tests Chi-Square & F One & Two-Tailed Free · No Signup

Free online p-value calculator for Z-test, T-test, Chi-square, and F-test. One-tailed and two-tailed tests with significance classification, interactive distribution curve, and Python scipy export.

P-Value Calculator
Standard normal test statistic

Result

📊

Enter parameters and click Calculate

Compute p-values for Z, T, Chi-square, or F test statistics.

Distribution Visualization

Python Compiler

What Is a P-Value?

A p-value is the probability of observing a test statistic at least as extreme as the one computed from sample data, assuming the null hypothesis (H₀) is true. It is a fundamental concept in hypothesis testing and helps researchers decide whether to reject H₀.

Fail to reject H₀ p/2 p/2 −z* +z* 0
📏

Probability Measure

The p-value quantifies how likely observed data (or more extreme) would occur if H₀ were true. Ranges from 0 to 1.

⚖️

Evidence Against H₀

A smaller p-value provides stronger evidence against the null hypothesis. It does NOT measure the probability that H₀ is true.

🎯

Decision Tool

Compare p-value to significance level α. If p ≤ α, reject H₀. Common thresholds: 0.05, 0.01, 0.10.

P-Value Interpretation Guide

The significance of a p-value depends on the chosen α level. Below are commonly used thresholds and their interpretations.

p < 0.001

Highly Significant — Very strong evidence against the null hypothesis. Often denoted with *** in research papers.

p < 0.05

Significant — Sufficient evidence to reject H₀ at the standard 5% level. The most common threshold in science.

p < 0.10

Marginal — Weak evidence against H₀. Sometimes called “trending toward significance.” Use with caution.

p ≥ 0.10

Not Significant — Insufficient evidence to reject H₀. Does not prove H₀ is true, only that data is consistent with it.

Test Distributions

Z-Test:  p = P(Z ≥ |z|) for standard normal. Used when σ is known or n is large (≥30).
T-Test:  p = P(T ≥ |t|) with df = n − 1. Used when σ is unknown and sample is small.
Chi-Square:  p = P(χ² ≥ x²) with df = k − 1. Used for goodness-of-fit and independence tests.
F-Test:  p = P(F ≥ f) with df₁ and df₂. Used for ANOVA and comparing two variances.
DistributionParametersCommon Use
Z (Normal)None (standard)Large samples, known σ, proportions
t (Student’s)df = n − 1Small samples, unknown σ
χ² (Chi-square)df = k − 1Categorical data, goodness-of-fit
F (Fisher)df₁, df₂ANOVA, variance comparison

One-Tailed vs Two-Tailed Tests

One-Tailed (right) α
Two-Tailed α/2 α/2

One-Tailed (Directional)

Tests for an effect in a specific direction (e.g., mean is greater than or less than a value). All α is concentrated in one tail, giving more power for that direction.

Two-Tailed (Non-directional)

Tests for any difference in either direction. The α is split between both tails (α/2 each). More conservative but appropriate when the direction is not pre-specified.

Important: The choice between one-tailed and two-tailed tests must be made before looking at the data. Choosing after seeing results inflates the false positive rate.

Common Misconceptions About P-Values

❌ “P is the probability H₀ is true”

Incorrect. The p-value is the probability of the data (or more extreme) given H₀ is true. It is P(data | H₀), not P(H₀ | data). Bayesian methods are needed for the latter.

❌ “1 − p = probability H₁ is true”

Incorrect. A p-value of 0.03 does not mean there is a 97% chance the alternative is true. The p-value tells you about the data under H₀, not the probability of any hypothesis.

❌ “Not significant = no effect”

Incorrect. Failure to reject H₀ does not prove H₀. It may simply mean the sample size was too small to detect the effect. “Absence of evidence is not evidence of absence.”

✅ Correct Interpretation

Correct: “Assuming H₀ is true, the probability of observing a test statistic as extreme as or more extreme than the one observed is p.” Always interpret in context.

Frequently Asked Questions

A p-value is the probability of observing results at least as extreme as the data, assuming the null hypothesis is true. A small p-value (e.g., less than 0.05) provides evidence against the null hypothesis. It is not the probability that the null hypothesis is true.
Use a one-tailed test when you have a directional hypothesis (e.g., the mean is greater than a value). Use two-tailed when you test for any difference in either direction. The choice must be made before looking at the data.
Use Z for large samples with known σ. Use t for small samples with unknown σ. Use chi-square for goodness-of-fit and independence tests. Use F for comparing variances or ANOVA.
The most common is α = 0.05 (95% confidence). For high-stakes decisions, use 0.01. For exploratory research, 0.10 is sometimes used. The threshold should be chosen before conducting the test.
No. Statistical significance does not equal practical significance. A large sample can produce a tiny p-value for a trivially small effect. Always report effect sizes alongside p-values to assess real-world importance.
Common mistakes include thinking p is the probability H₀ is true, or that 1 − p is the probability H₁ is true, or that a non-significant result proves no effect exists. The p-value only measures evidence against H₀ under the assumption H₀ is true.

Support This Free Tool

Every coffee helps keep the servers running. Every book sale funds the next tool I'm dreaming up. You're not just supporting a site — you're helping me build what developers actually need.

500K+ users
200+ tools
100% private
Privacy Guarantee: Private keys you enter or generate are never stored on our servers. All tools are served over HTTPS.