When we say we have a 95% confidence interval (CI) for a population parameter (like the mean), this interval is based on the sample data we have collected. The CI gives us a range within which we are “95% confident” that the true population parameter lies, based on our sample. This confidence level is determined before we even collect our sample data, and it’s about the process used to estimate the interval, not the interval itself.
The correct interpretation of a 95% CI is this: If we were to take many samples from the population and calculate a 95% CI from each sample, we would expect 95% of those intervals to contain the true population parameter. It’s about the long-run frequency of these intervals capturing the true parameter if we repeated our sampling process many times under the same conditions.
However, for a specific interval we calculate from a single sample, it’s not accurate to say there’s a 95% probability that this particular interval contains the true population parameter. Once the interval is calculated, the true parameter is either within it or not; there’s no probability about it. This is because the true parameter is a fixed value, not a random variable, and the probability concept doesn’t apply in the way we might intuitively think.
The misconception often comes from interpreting the confidence interval in a Bayesian manner without intending to. In a Bayesian framework, it would make sense to talk about the probability of a parameter being within a certain range given the data (because Bayesian statistics allow for probability statements about parameters themselves). However, in the frequentist framework, which confidence intervals belong to, probabilities are related to the process of sampling and not to the parameters themselves.
In summary, the 95% confidence level refers to the procedure of how often the intervals, when constructed using the same method across numerous samples, would contain the true parameter, not to the probability of a single, specific interval capturing this parameter upon its computation.