Prediction intervals1 / 11

confidence intervals

If we use the same sampling method to select different samples and computed an interval estimate for each sample, we would expect the true population parameter ( $β_{1}$ ) to fall within the interval estimates 95% of the time.

2 / 11

Confidence interval for $β_{1}$

How do we calculate the confidence interval for the slope?

3 / 11

Confidence interval for $β_{1}$

How do we calculate the confidence interval for the slope?

${\hat{β}}_{1} \pm t^{*} S E_{{\hat{β}}_{1}}$

3 / 11

by Dr. Lucy D'Agostino McGowan

How do we calculate it in R?using the broom package
lm(Weight ~ WingLength, Sparrows) %>%
  tidy(conf.int = TRUE)

## # A tibble: 2 x 7
##   term        estimate std.error statistic  p.value conf.low conf.high
##   <chr>          <dbl>     <dbl>     <dbl>    <dbl>    <dbl>     <dbl>
## 1 (Intercept)    1.37     0.957       1.43 1.56e- 1   -0.531     3.26 
## 2 WingLength     0.467    0.0347     13.5  2.62e-25    0.399     0.536
4 / 11

by Dr. Lucy D'Agostino McGowan

How do we calculate it in R?using the broom package
lm(Weight ~ WingLength, Sparrows) %>%
  tidy(conf.int = TRUE)

## # A tibble: 2 x 7
##   term        estimate std.error statistic  p.value conf.low conf.high
##   <chr>          <dbl>     <dbl>     <dbl>    <dbl>    <dbl>     <dbl>
## 1 (Intercept)    1.37     0.957       1.43 1.56e- 1   -0.531     3.26 
## 2 WingLength     0.467    0.0347     13.5  2.62e-25    0.399     0.536
"by hand"
t_star <- qt(0.025, df = 116 - 2, lower.tail = FALSE)
# or
t_star <- qt(0.975, df = 116 - 2)
0.467 - t_star * 0.0347

## [1] 0.3982596
0.467 + t_star * 0.0347

## [1] 0.5357404
4 / 11

Confidence intervals

There are ✌️ other types of confidence intervals we may want to calculate

5 / 11

Confidence intervals

There are ✌️ other types of confidence intervals we may want to calculate

The confidence interval for the mean response in $y$ for a given $x^{*}$ value

5 / 11

Confidence intervals

There are ✌️ other types of confidence intervals we may want to calculate

The confidence interval for the mean response in $y$ for a given $x^{*}$ value
The confidence interval for an individual response $y$ for a given $x^{*}$ value

5 / 11

Confidence intervals

There are ✌️ other types of confidence intervals we may want to calculate

The confidence interval for the mean response in $y$ for a given $x^{*}$ value: confidence interval for $μ_{y}$
The confidence interval for an individual response $y$ for a given $x^{*}$ value: prediction interval

6 / 11

Confidence intervals

There are ✌️ other types of confidence intervals we may want to calculate

The confidence interval for the mean response in $y$ for a given $x^{*}$ value: confidence interval for $μ_{y}$
The confidence interval for an individual response $y$ for a given $x^{*}$ value: prediction interval
Why are these different? Which do you think is easier to estimate?

6 / 11

Confidence intervals

There are ✌️ other types of confidence intervals we may want to calculate

The confidence interval for the mean response in $y$ for a given $x^{*}$ value: confidence interval for $μ_{y}$
The confidence interval for an individual response $y$ for a given $x^{*}$ value: prediction interval
Why are these different? Which do you think is easier to estimate?
It is harder to predict one response than to predict a mean response. What does this mean in terms of the standard error?

6 / 11

Confidence intervals

There are ✌️ other types of confidence intervals we may want to calculate

The confidence interval for the mean response in $y$ for a given $x^{*}$ value: confidence interval for $μ_{y}$
The confidence interval for an individual response $y$ for a given $x^{*}$ value: prediction interval
Why are these different? Which do you think is easier to estimate?
It is harder to predict one response than to predict a mean response. What does this mean in terms of the standard error?
The SE of the prediction interval is going to be larger

6 / 11