Friday, August 16, 2019

Confidence Intervals

The confidence intervals represent upper and lower bounds of variation around each reference forecast. Values may occur outside the confidence intervals due to external shocks, such as extreme weather, structural changes to the economic system, geopolitical events, or technology development. The confidence intervals increase in width throughout the forecast period due to the increasing level of uncertainty in each subsequent year.The upper and lower bounds were based on one to two standard deviations of the historic values, indicating at least a 68 percent probability that future values would be expected to fall within the confidence interval. The confidence interval for the first forecast year is based on one standard deviation and grows linearly until it reaches two standard deviations, or a 95 percent probability. =====================================================================================For example, if we have polled a number of respondents from the home owners let†™s say 3500 respondents, and from those only 1190 are using electricity to heat their homes, this means that 34. 0% are using electricity to heat their homes, p? = 1190/3500 = 34. 0%. And we know that a second sample of 3500 home owners wouldn’t have a sample proportion of exactly 34. 0%. If another group of home owners has taken and we found that they have a sample of proportion of 38. 0%, So the sampling proportion will be the key to our ability to generalize from our sample to the population.Now, we know that the sampling distribution model is centered at the true proportion, p, of all home owners who use electricity to heat their homes. But we don’t know p. it isn’t 34. 0%. That’s the p? from our sample. What we do know is that the sampling distribution model of p? centered at p, and we know that the standard deviation of the sampling distribution is SE(p? ) = v p? q? /n = v(. 34)(1-. 34)/3500 = 0. 008 Because our sample (3500) is large, we know tha t the sampling distribution model for p? should look approximately like the one shown below The sampling distribution model for p? s normal with a mean of p and a standard deviation we estimate to be v p? q? /n. because the distribution is normal, we’d expect that about 68% of all samples of 3500 home owners taken in a specific time would have had sample proportion within 1 standard deviation of p. and about 95% of all these samples will have proportions within p  ± 2 SEs. But where is our sample proportion in this picture? And what value does p have? We still don’t know! We do know that for 95% of random samples, p? will be no more than 2 SEs away from p. so let’s reverse it and look at it from p? ’s point of view. If I am p? there is a 95% chance that p is no more than 2 SEs away from me. If I reach out 2 SEs, or 2 x 0. 008, away from me on both sides, I am 95% sure that p will be within my grasp. Of course, I won’t know, and even if my interva l does catch p, I still don’t know its true value. Now, We don’t know exactly what proportion of home owners using electricity to heat their homes, but we know that it’s within the interval 34. 0%  ± 2 x 0. 8%. That is, it’s between 32. 2% and 35. 6%. † this is getting closer, but we still can’t be certain. We can’t know for sure that the true proportion is in this interval-or in any particular range.We don’t know the exactly the proportion of home owners that use electricity to heat their homes, but the interval from 32. 4% to 35. 6% probably contains the true proportion. † We’ve now fudged twice—first by giving an interval and second by admitting that we only think the interval â€Å"probably† contains the true value. That last statement is true; we can tighten it up by quantifying what we mean by â€Å"probably. † We saw that 95% of the time when we reach out 2 SEs from p? , we capture p, s o we can be 95% confident that this is one of those times, after putting a number on the probability that this interval covers the true proportion,We are 95% confident that between 32. 4% and 35. 6% of home owners using electricity to heat their homes, this is now an appropriate interpretation of our confidence intervals. It’s not perfect, but it’s about the best we can do. Level of Confidence = 95% =1- ? = 0. 05 Sample size (n) = 3500 Sample proportion (p? ) = 0. 34 P complement (q? ) = (1-0. 34) = 0. 66 Sample standard error of a proportion = SE(p? ) = v p? q? /n = v(. 34)(1-. 34)/3500 = 0. 008 Z Score(1 – ? ) = Z Score95% = NORMSINV (1 – ? /2) = NORMSINV (1 – 0. 025) = NORMSINV (0. 975) = 1. 96 Width of Half the Confidence Interval = Z Score(1-? *  S(q? ) = 1. 96 *  0. 008 = 0. 01568 Confidence Interval Boundaries = p? +/- Z Score(1-? )*S(p? )= 0. 34  ± (1. 96) * (0. 008) = 0. 34  ± (0. 01568) = 0. 32432 to 0. 35568 = 32. 43% to 35. 57% We can be 95% certain that the percentage of home owners using electricity to heat their homes is between 32. 43% and 35. 57% 0. 01568 0. 01568 0. 316 0. 3243 0. 332 0. 34 0 . 348 0. 3557 0. 364 0. 316 0. 3243 0. 332 0. 34 0 . 348 0. 3557 0. 364

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.