Normal Distribution Probability Density Function (PDF)
The probability density function of a normal distribution describes the likelihood of a continuous random variable taking on a specific value based on its mean and variance.
This public page keeps the free explanation visible and leaves premium worked solving, advanced walkthroughs, and saved study tools inside the app.
Core idea
Overview
This formula represents the classic bell-shaped Gaussian curve, where the peak is defined by the mean (μ) and the spread or width is controlled by the variance (σ²). It is the cornerstone of inferential statistics, as the Central Limit Theorem dictates that the sums of many independent random variables tend toward this distribution. The integral of this function over any interval represents the probability that the random variable falls within that range.
When to use: Use this to model physical, biological, or social phenomena where data points cluster around a central average with symmetric deviations.
Why it matters: It allows for the calculation of probabilities, hypothesis testing, and the estimation of parameters in nearly all scientific and engineering fields.
Symbols
Variables
x = Random Variable, = Mean, = Variance
Walkthrough
Derivation
Derivation of Normal Distribution Probability Density Function (PDF)
The normal distribution is derived from the requirement that the maximum likelihood estimator for a mean of independent observations is the arithmetic mean, leading to Gauss's functional equation.
- The probability density function f(x) is dependent only on the distance from the mean.
- The joint probability of independent observations is the product of their individual probabilities.
- The function must be normalized such that the total area under the curve equals 1.
Formulating the Functional Equation
Assuming the most probable value for the mean is the arithmetic mean, the product of the densities must be a function of the sum of squares of the observations.
Note: This is often referred to as Gauss's derivation based on the postulate of the arithmetic mean.
Solving via Logarithmic Differentiation
By taking the natural logarithm of both sides, the product transforms into a sum, which implies that the derivative must be linear, leading to the form f(x) = .
Note: We identify 'a' as negative to ensure the function decays as |x| increases.
Determining Constants
We use the Gaussian integral identity to find the normalization constant C, ensuring the total probability integrates to 1.
Note: Recall that the integral of is the square root of pi.
Final Normalization
Substituting the variance sigma-squared for the spread parameter yields the standard form of the normal PDF.
Note: This final form satisfies the property that the distribution is centered at mu with variance sigma-squared.
Result
Source: Gauss, C. F. (1809). Theoria motus corporum coelestium.
Free formulas
Rearrangements
Solve for
Make x the subject
Isolate the variable x by taking the natural logarithm and performing algebraic operations.
Difficulty: 3/5
Solve for
Make the subject
Solve for the mean by isolating the squared term inside the exponent.
Difficulty: 3/5
Solve for
Make the subject
Solve for the variance by using the Lambert W function or iterative methods as appears in both base and exponent.
Difficulty: 4/5
The static page shows the finished rearrangements. The app keeps the full worked algebra walkthrough.
Why it behaves this way
Intuition
Imagine a physical mountain range created by dropping sand on a flat surface. The peak (the mean) is where most of the sand gathers, and the height drops off exponentially as you move away from the center. The curve is a 'gravity-weighted' shape where the steepness of the slopes is controlled by the spread of the sand; a wide pile (large variance) is gentle, while a tall, skinny spike (small variance) is steep.
Signs and relationships
- -(x - μ)²: The negative sign ensures the exponent is always negative or zero, creating a peak at the mean (where x=μ) and causing the function to decay toward zero as x moves away from the mean.
- 1 / sqrt(2πσ²): This is the 'normalization constant.' It ensures the total area under the entire curve is exactly 1, representing 100% total probability.
One free problem
Practice Problem
For a normal distribution with a mean (μ) of 0 and a variance (σ²) of 1, calculate the density f(x) at x = 0.
Solve for:
Hint: Recall that = 1 and the expression simplifies to 1/sqrt(2π).
The full worked solution stays in the interactive walkthrough.
Where it shows up
Real-World Context
The heights of adult men in a specific population, which cluster around an average height with a predictable standard deviation.
Study smarter
Tips
- Remember that the total area under the curve is always exactly 1.
- Use the standard normal distribution (Z-score) by setting μ=0 and σ=1 to simplify complex calculations.
- Note that approximately 68%, 95%, and 99.7% of data fall within 1, 2, and 3 standard deviations of the mean, respectively.
Avoid these traps
Common Mistakes
- Confusing standard deviation (σ) with variance (σ²).
- Assuming the PDF value is a probability itself, rather than a density (the probability of an exact point is 0).
Common questions
Frequently Asked Questions
The normal distribution is derived from the requirement that the maximum likelihood estimator for a mean of independent observations is the arithmetic mean, leading to Gauss's functional equation.
Use this to model physical, biological, or social phenomena where data points cluster around a central average with symmetric deviations.
It allows for the calculation of probabilities, hypothesis testing, and the estimation of parameters in nearly all scientific and engineering fields.
Confusing standard deviation (σ) with variance (σ²). Assuming the PDF value is a probability itself, rather than a density (the probability of an exact point is 0).
The heights of adult men in a specific population, which cluster around an average height with a predictable standard deviation.
Remember that the total area under the curve is always exactly 1. Use the standard normal distribution (Z-score) by setting μ=0 and σ=1 to simplify complex calculations. Note that approximately 68%, 95%, and 99.7% of data fall within 1, 2, and 3 standard deviations of the mean, respectively.
References
Sources
- Feller, W. (1968). An Introduction to Probability Theory and Its Applications.
- Ross, S. M. (2014). A First Course in Probability.
- Gauss, C. F. (1809). Theoria motus corporum coelestium.