PsychologyResearch Design, Power AnalysisUniversity
AQAAPOntarioNSWCBSEGCE O-LevelMoECAPS

Sample Size for Estimating a Mean

Calculates the minimum sample size required to estimate a population mean with a specified confidence level and margin of error.

Understand the formulaSee the free derivationOpen the full walkthrough

This public page keeps the free explanation visible and leaves premium worked solving, advanced walkthroughs, and saved study tools inside the app.

Core idea

Overview

This formula is crucial in research design for determining the optimal number of participants or observations needed to achieve a desired level of precision and confidence in estimating a population mean. It balances the need for statistical power with practical resource constraints, ensuring that studies are adequately powered to detect meaningful effects without being unnecessarily large. The formula incorporates the desired confidence level (via the z-score), the variability of the population (standard deviation), and the acceptable margin of error.

When to use: Use this equation during the planning phase of a study when you need to determine how many participants or data points are required to estimate a population mean within a certain margin of error and with a specific confidence level. It's essential before data collection begins to ensure the study is adequately powered.

Why it matters: Proper sample size determination is fundamental to ethical and efficient research. Too small a sample may lead to inconclusive results or a failure to detect true effects (Type II error), wasting resources. Too large a sample can be unnecessarily costly, time-consuming, and potentially unethical if participants are exposed to risks without added benefit. This formula ensures statistical validity and resource optimization.

Symbols

Variables

n = Sample Size, z = Z-score, s = Standard Deviation, E = Margin of Error

Sample Size
count
Z-score
dimensionless
Standard Deviation
units
Margin of Error
units

Walkthrough

Derivation

Formula: Sample Size for Estimating a Mean

This formula determines the minimum sample size needed to estimate a population mean with a specified confidence level and margin of error.

  • The population standard deviation (s) is known or can be reliably estimated.
  • The sampling distribution of the mean is approximately normal, which is generally true for large sample sizes (Central Limit Theorem) or normally distributed populations.
  • The sample is randomly selected from the population.
1

Start with the Margin of Error formula:

The margin of error (E) for a population mean is defined as the product of the critical z-score (z) for the desired confidence level and the standard error of the mean (s/√n).

2

Isolate the square root of n:

To isolate 'n', first multiply both sides by √n and divide by E, effectively swapping them.

3

Square both sides:

Square both sides of the equation to remove the square root from 'n', yielding the formula for the required sample size.

Note: Always round the final calculated 'n' up to the next whole number.

Result

Source: Gravetter, F. J., & Wallnau, L. B. (2017). Statistics for the Behavioral Sciences (10th ed.). Cengage Learning. Chapter 8: Introduction to Hypothesis Testing.

Free formulas

Rearrangements

Solve for

Make n the subject

n is already the subject of the formula.

Difficulty: 1/5

Solve for

Sample Size for Estimating a Mean: Make z the subject

To make (Z-score) the subject, rearrange the formula by isolating on one side of the equation.

Difficulty: 2/5

Solve for

Sample Size for Estimating a Mean: Make s the subject

To make (Standard Deviation) the subject, rearrange the formula by isolating on one side of the equation.

Difficulty: 2/5

Solve for

Sample Size for Estimating a Mean: Make E the subject

To make (Margin of Error) the subject, rearrange the formula by isolating on one side of the equation.

Difficulty: 2/5

The static page shows the finished rearrangements. The app keeps the full worked algebra walkthrough.

Visual intuition

Graph

The graph displays a steep decay pattern where the sample size n decreases as the margin of error E increases, with the curve approaching the axes as asymptotes for all values greater than zero. For a psychology student, this means that demanding a very small margin of error requires a rapidly increasing number of participants, while accepting a larger margin of error allows for a much smaller sample size. The most critical feature of this curve is that the sample size never reaches zero, illustrating that even with a very large margin of error, some data collection is always necessary to estimate a population mean.

Graph type: inverse

Why it behaves this way

Intuition

Imagine trying to pinpoint the center of a target (the population mean) with a certain level of accuracy (margin of error 'E'). The variability of your individual measurements ('s')

The required number of observations or participants in a study.
A larger sample size 'n' generally leads to more precise and reliable estimates of the population mean, reducing the impact of random sampling variation.
The critical value from the standard normal distribution corresponding to the desired confidence level.
A higher desired confidence level (e.g., 99% vs. 95%) requires a larger 'z' value, which in turn demands a larger sample size to achieve the same margin of error, as you are aiming for greater certainty.
An estimate of the population standard deviation, representing the inherent variability or spread of data points within the population.
If the data is highly variable (large 's'), more observations are needed to accurately represent the population and estimate its mean, because individual data points are less representative of the whole.
The maximum allowable difference between the sample mean and the true population mean; it defines the desired precision of the estimate.
A smaller, more stringent margin of error 'E' (i.e., desiring a more precise estimate) requires a significantly larger sample size, as it's harder to narrow down the estimate to a very small range.

Signs and relationships

  • (z · s): This product represents the 'total spread' or variability that needs to be accounted for in the sampling distribution of the mean, adjusted for the desired confidence.
  • /E: The margin of error 'E' is in the denominator, indicating an inverse relationship. As 'E' decreases (demanding more precision), the ratio increases, requiring a larger sample size.
  • ()^2: The entire ratio is squared, meaning that the sample size 'n' increases quadratically with the desired confidence (z) and population variability (s), and decreases quadratically with the desired margin of error (E).

Free study cues

Insight

Canonical usage

This equation calculates a dimensionless count (sample size). The standard deviation (s) and margin of error (E) must be in consistent units, making their ratio dimensionless.

Common confusion

A common mistake is using inconsistent units for 's' (standard deviation) and 'E' (margin of error), which would lead to an incorrect dimensionless ratio and thus an incorrect sample size.

Dimension note

The sample size (n) is a count and therefore dimensionless. The terms 's' and 'E' must share the same units, ensuring their ratio is dimensionless, which is then squared to yield a dimensionless sample size.

Unit systems

dimensionless (count) · Represents the number of observations or participants, always an integer.
dimensionless · The z-score corresponding to the desired confidence level (e.g., 1.96 for 95% confidence). It is a pure number from the standard normal distribution.
units of the measured variable · The estimated population standard deviation. Its units will match the units of the variable being measured (e.g., 'score', 'seconds', 'USD').
units of the measured variable · The desired margin of error. Its units must be the same as the standard deviation (s) and the measured variable.

One free problem

Practice Problem

A psychologist wants to estimate the average IQ score of a specific population. They desire a 95% confidence level and a margin of error of 3 IQ points. Based on previous research, the population standard deviation is estimated to be 15 IQ points. What is the minimum sample size required for this study?

Z-score1.96 dimensionless
Standard Deviation15 units
Margin of Error3 units

Solve for:

Hint: Remember to round up the final sample size.

The full worked solution stays in the interactive walkthrough.

Where it shows up

Real-World Context

Determining the number of patients needed for a clinical trial to estimate the average reduction in blood pressure from a new drug.

Study smarter

Tips

  • Always use the appropriate z-score for your desired confidence level (e.g., 1.96 for 95%, 2.58 for 99%).
  • An estimate of the population standard deviation (s) is often obtained from pilot studies or previous research.
  • A smaller margin of error (E) or higher confidence level will require a larger sample size.
  • The calculated sample size 'n' should always be rounded up to the next whole number, as you cannot have a fraction of a participant.

Avoid these traps

Common Mistakes

  • Using an incorrect z-score for the desired confidence level.
  • Failing to round the final sample size 'n' up to the nearest whole number.
  • Underestimating the population standard deviation, leading to an underpowered study.

Common questions

Frequently Asked Questions

This formula determines the minimum sample size needed to estimate a population mean with a specified confidence level and margin of error.

Use this equation during the planning phase of a study when you need to determine how many participants or data points are required to estimate a population mean within a certain margin of error and with a specific confidence level. It's essential before data collection begins to ensure the study is adequately powered.

Proper sample size determination is fundamental to ethical and efficient research. Too small a sample may lead to inconclusive results or a failure to detect true effects (Type II error), wasting resources. Too large a sample can be unnecessarily costly, time-consuming, and potentially unethical if participants are exposed to risks without added benefit. This formula ensures statistical validity and resource optimization.

Using an incorrect z-score for the desired confidence level. Failing to round the final sample size 'n' up to the nearest whole number. Underestimating the population standard deviation, leading to an underpowered study.

Determining the number of patients needed for a clinical trial to estimate the average reduction in blood pressure from a new drug.

Always use the appropriate z-score for your desired confidence level (e.g., 1.96 for 95%, 2.58 for 99%). An estimate of the population standard deviation (s) is often obtained from pilot studies or previous research. A smaller margin of error (E) or higher confidence level will require a larger sample size. The calculated sample size 'n' should always be rounded up to the next whole number, as you cannot have a fraction of a participant.

References

Sources

  1. Andy Field, Discovering Statistics Using IBM SPSS Statistics
  2. Wikipedia: Sample size determination
  3. Discovering Statistics Using IBM SPSS Statistics, Andy Field
  4. Statistical Methods for Psychology, David C. Howell
  5. Gravetter, F. J., & Wallnau, L. B. (2017). Statistics for the Behavioral Sciences (10th ed.). Cengage Learning.
  6. Field, A. (2018). Discovering Statistics Using IBM SPSS Statistics (5th ed.). SAGE Publications.
  7. OpenStax. (2021). Introductory Statistics. Rice University.
  8. Gravetter, F. J., & Wallnau, L. B. (2017). Statistics for the Behavioral Sciences (10th ed.). Cengage Learning. Chapter 8: Introduction to Hypothesis Testing.