Interval Estimation Fundamentals
TL;DR
Interval estimation provides a range (an interval) within which a population parameter, like the mean or proportion, is likely to fall. This interval consists of a point estimate plus or minus a margin of error, giving you a measure of confidence in your estimate. The specific method used to calculate this interval depends on whether the population standard deviation is known or unknown.
1. The Mental Model
Imagine you're trying to guess someone's age. Instead of saying "they're exactly 30," you might say "they're between 28 and 32." Interval estimation works similarly, giving you a range of plausible values for a population characteristic rather than a single, precise number.
2. The Core Material
When you're trying to estimate a population parameter (like the average income in a city), it's often impractical to measure every single item in the population. Instead, you take a sample and use that sample to create an interval estimate.
The general form of an interval estimate for a population mean ($\mu$) is:
Point Estimate $\pm$ Margin of Error
Here, the point estimate is usually your sample mean ($\bar{x}$).
Understanding the Margin of Error
The margin of error quantifies the precision of your estimate. A smaller margin of error means your interval is narrower and more precise. It's calculated based on your sample data, the desired confidence level, and the variability of the population.
Interval Estimate of a Population Mean: $\sigma$ Known
If you know the population standard deviation ($\sigma$), you'll use the Z-distribution (standard normal distribution) to construct your interval.
The formula for the interval estimate of $\mu$ when $\sigma$ is known is:
$\bar{x} \pm z_{\alpha/2} \frac{\sigma}{\sqrt{n}}$
Where:
* $\bar{x}$ is the sample mean.
* $z_{\alpha/2}$ is the critical Z-value corresponding to your desired confidence level.
* $\alpha$ represents the probability of your interval not containing the true population mean.
* $\alpha/2$ is the area in each tail of the Z-distribution.
* $\sigma$ is the population standard deviation.
* $n$ is the sample size.
Confidence Level Explained:
If you say you're 90% confident, it means that if you were to construct many such intervals, 90% of them would contain the true population mean. The remaining 10% (the $\alpha$) would not.
Common $z_{\alpha/2}$ Values:
| Confidence Level | $