The classical linear regression model for
observations and
regressors
has the form
,
, where
is a
vector of unknown coefficients, the data
and
are observed, and the
unobserved disturbance
has conditional mean
and variance
. We seek parameter estimates
that converge in probability to
as
gets large: we want
consistent estimates.27 Least squares (LS) chooses
to minimize the sum of squared residuals
over all
:
For the regression model the maximum possible breakdown point is, asymptotically, 0.5. One popular estimator that achieves that maximum is least median of squares (LMS):
Other estimators exist that achieve the maximum breakdown point while having a
convergence rate and better Gaussian efficiency than LMS does. The
LQD estimator (Croux et al., 1994) is defined by choosing
to minimize the (approximately) first quartile of the absolute
differences between pairs of residuals. Let
The LQD objective function is difficult to optimize. Because high breakdown point estimators are not smooth functions of the data, optimization techniques that are based solely on derivative information, such as Newton-Raphson, are highly unreliable (Stromberg, 1993). In general, high breakdown point objective functions have multiple minima. Therefore, the use of local optimization techniques is not reliable. But in our application, there does appear to be local hill-climbing information contained in the derivatives. Therefore, we use a global optimizer which makes use of derivative information: GENetic Optimization Using Derivatives (GENOUD) (Sekhon and Mebane, 1998). GENOUD combines evolutionary algorithm methods with a derivative-based, quasi-Newton method to solve difficult unconstrained optimization problems.30