Multiple Logistic Regression1 / 23

by Dr. Lucy D'Agostino McGowan

2 / 23

by Dr. Lucy D'Agostino McGowan

Types of statistical models

response
predictor(s)
model


quantitative
one quantitative
simple linear regression

quantitative
two or more (of either kind)
multiple linear regression

binary
one (of either kind)
simple logistic regression

binary
two or more (of either kind)
multiple logistic regression

3 / 23

response	predictor(s)	model
quantitative	one quantitative	simple linear regression
quantitative	two or more (of either kind)	multiple linear regression
binary	one (of either kind)	simple logistic regression
binary	two or more (of either kind)	multiple logistic regression

by Dr. Lucy D'Agostino McGowan

Types of statistical models

response
predictor(s)
model


quantitative
one quantitative
simple linear regression

quantitative
two or more (of either kind)
multiple linear regression

binary
one (of either kind)
simple logistic regression

binary
two or more (of either kind)
multiple logistic regression

4 / 23

response	predictor(s)	model
quantitative	one quantitative	simple linear regression
quantitative	two or more (of either kind)	multiple linear regression
binary	one (of either kind)	simple logistic regression
binary	two or more (of either kind)	multiple logistic regression

by Dr. Lucy D'Agostino McGowan

Types of statistical models

variables
predictor
ordinary regression
logistic regression


one: xx
β0+β1xβ0+β1x
Response yy
logit(π)=log(π1−π)logit(π)=log⁡(π1−π)

several: x1,x2,…,xkx1,x2,…,xk
β0+β1x1+⋯+βkxkβ0+β1x1+⋯+βkxk
Response yy
logit(π)=log(π1−π)logit(π)=log⁡(π1−π)

5 / 23

variables	predictor	ordinary regression	logistic regression
one: $x$	$β_{0} + β_{1} x$	Response $y$	$logit (π) = \log (\frac{π}{1 - π})$
several: $x_{1}, x_{2}, \dots, x_{k}$	$β_{0} + β_{1} x_{1} + \dots + β_{k} x_{k}$	Response $y$	$logit (π) = \log (\frac{π}{1 - π})$

by Dr. Lucy D'Agostino McGowan

Multiple logistic regression✌️ forms

Form
Model

Logit form
log(π1−π)=β0+β1x1+β2x2+…βkxklog⁡(π1−π)=β0+β1x1+β2x2+…βkxk

Probability form
π=eβ0+β1x1+β2x2+…βkxk1+eβ0+β1x1+β2x2+…βkxkπ=eβ0+β1x1+β2x2+…βkxk1+eβ0+β1x1+β2x2+…βkxk

6 / 23

Form	Model
Logit form	$\log (\frac{π}{1 - π}) = β_{0} + β_{1} x_{1} + β_{2} x_{2} + \dots β_{k} x_{k}$
Probability form	$π = \frac{e^{β_{0} + β_{1} x_{1} + β_{2} x_{2} + \dots β_{k} x_{k}}}{1 + e^{β_{0} + β_{1} x_{1} + β_{2} x_{2} + \dots β_{k} x_{k}}}$

Steps for modeling

7 / 23

by Dr. Lucy D'Agostino McGowan

Fitdata(MedGPA)
glm(Acceptance ~ MCAT + GPA, data = MedGPA, family = "binomial") %>%
  tidy(conf.int = TRUE)

## # A tibble: 3 x 7
##   term        estimate std.error statistic  p.value conf.low conf.high
##   <chr>          <dbl>     <dbl>     <dbl>    <dbl>    <dbl>     <dbl>
## 1 (Intercept)  -22.4       6.45      -3.47 0.000527 -36.9      -11.2  
## 2 MCAT           0.165     0.103      1.59 0.111     -0.0260     0.383
## 3 GPA            4.68      1.64       2.85 0.00439    1.74       8.27
8 / 23

Fit

What does this do?

glm(Acceptance ~ MCAT + GPA, data = MedGPA, family = "binomial") %>%
  tidy(conf.int = TRUE)

## # A tibble: 3 x 7
##   term        estimate std.error statistic  p.value conf.low conf.high
##   <chr>          <dbl>     <dbl>     <dbl>    <dbl>    <dbl>     <dbl>
## 1 (Intercept)  -22.4       6.45      -3.47 0.000527 -36.9      -11.2  
## 2 MCAT           0.165     0.103      1.59 0.111     -0.0260     0.383
## 3 GPA            4.68      1.64       2.85 0.00439    1.74       8.27

9 / 23

Fit

What does this do?

glm(Acceptance ~ MCAT + GPA, data = MedGPA, family = "binomial") %>%
  tidy(conf.int = TRUE)

## # A tibble: 3 x 7
##   term        estimate std.error statistic  p.value conf.low conf.high
##   <chr>          <dbl>     <dbl>     <dbl>    <dbl>    <dbl>     <dbl>
## 1 (Intercept)  -22.4       6.45      -3.47 0.000527 -36.9      -11.2  
## 2 MCAT           0.165     0.103      1.59 0.111     -0.0260     0.383
## 3 GPA            4.68      1.64       2.85 0.00439    1.74       8.27

10 / 23

Fit

What does this do?

glm(Acceptance ~ MCAT + GPA, data = MedGPA, family = "binomial") %>%
  tidy(conf.int = TRUE) %>%
  kable()

term	estimate	std.error	statistic	p.value	conf.low	conf.high
(Intercept)	-22.373	6.454	-3.47	0.001	-36.894	-11.235
MCAT	0.165	0.103	1.59	0.111	-0.026	0.383
GPA	4.676	1.642	2.85	0.004	1.739	8.272

11 / 23

Assess

What are the assumptions of multiple logistic regression?

12 / 23

Assess

What are the assumptions of multiple logistic regression?

Linearity
Independence
Randomness

12 / 23

Assess

How do you determine whether the conditions are met?

Linearity
Independence
Randomness

13 / 23

Assess

How do you determine whether the conditions are met?

Linearity: empirical logit plots
Independence: look at the data generation process
Randomness: look at the data generation process (does the spinner model make sense?)

14 / 23