In statistics, a linear probability model (LPM) is a special case of a binary regression model. Here the dependent variable for each observation takes values which are either 0 or 1. The probability of observing a 0 or 1 in any one case is treated as depending on one or more explanatory variables. For the "linear probability model", this relationship is a particularly simple one, and allows the model to be fitted by linear regression.
The model assumes that, for a binary outcome (Bernoulli trial), <math>Y</math>, and its associated vector of explanatory variables, <math>X</math>,
: <math> \Pr(Y=1 | X=x) = x'\beta . </math>
For this model,
:<math> E[Y|X] = 0\cdot \Pr(Y=0|X) +1\cdot \Pr(Y=1|X) = \Pr(Y=1|X) =x'\beta,</math>
and hence the vector of parameters β can be estimated using least squares. This method of fitting would be inefficient,), as follows: assume the following regression model with a latent (unobservable) dependent variable:
: <math>y^* = b_0+ \mathbf x'\mathbf b + \varepsilon,\;\; \varepsilon\mid \mathbf x\sim U(-a,a).</math>
The critical assumption here is that the error term of this regression is a symmetric around zero uniform random variable, and hence, of mean zero. The cumulative distribution function of <math>\varepsilon</math> here is <math>F_{\varepsilon|\mathbf x}(\varepsilon\mid \mathbf x) = \frac {\varepsilon + a}{2a}.</math>
Define the indicator variable <math> y = 1</math> if <math> y^* >0</math>, and zero otherwise, and consider the conditional probability
:<math>{\rm Pr}(y =1\mid \mathbf x ) = {\rm Pr}(y^* > 0\mid \mathbf x) = {\rm Pr}(b_0+ \mathbf x'\mathbf b + \varepsilon>0\mid \mathbf x) </math>
:<math> = {\rm Pr}(\varepsilon >- b_0- \mathbf x'\mathbf b\mid \mathbf x) = 1- {\rm Pr}(\varepsilon \leq - b_0- \mathbf x'\mathbf b\mid \mathbf x)</math>
:<math>=1- F_{\varepsilon|\mathbf x}(- b_0- \mathbf x'\mathbf b\mid \mathbf x) =1- \frac {- b_0- \mathbf x'\mathbf b + a}{2a} = \frac {b_0+a}{2a}+\frac {\mathbf x'\mathbf b}{2a}.</math>
But this is the Linear Probability Model,
:<math>P(y =1\mid \mathbf x )= \beta_0 + \mathbf x'\beta</math>
with the mapping
:<math>\beta_0 = \frac {b_0+a}{2a},\;\; \beta=\frac{\mathbf b}{2a}.</math>
This method is a general device to obtain a conditional probability model of a binary variable: if we assume that the distribution of the error term is logistic, we obtain the logit model, while if we assume that it is the normal, we obtain the probit model and, if we assume that it is the logarithm of a Weibull distribution, the complementary log-log model.
See also
- Linear approximation
References
Further reading
- Horrace, William C., and Ronald L. Oaxaca. "Results on the Bias and Inconsistency of Ordinary Least Squares for the Linear Probability Model." Economics Letters, 2006: Vol. 90, P. 321–327
