Our model is a generalized linear model (GLM) (McCullagh and Nelder, 1989) of
the binomial family with a logistic link, allowing for overdispersion. The
dependent variable is the proportion of the presidential vote in reporting unit
that was cast for Buchanan, denoted
, out of the
total number of votes cast for Browne, Buchanan, Bush, Gore, Hagelin, Nader, Phillips (when the candidate appears on the ballot). Let
denote the number of votes for Buchanan and let
denote the total number of votes cast for either Buchanan, Bush, Gore or Nader
in reporting unit
. The proportion we study is
. We base the GLM's linear predictor, denoted
, on the proportions of the vote in reporting unit
that were cast
for Bush and for Nader, denoted respectively
and
. The linear predictor is defined as
We are interested in the discrepancy between the actual number of votes for
Buchanan in reporting unit
(
) and the predicted
number of votes, denoted
. The simplest measure of that discrepancy is the
simple residual defined by
A problem with the simple residuals is that, in a sense, the size of
residual that we should expect to occur depends on the size of the
support for Buchanan that the model predicts. As the size of the
expected proportion
increases from zero
toward 0.5, the chances of observing a larger residual increases.
This may be a real problem where the main question is whether support
for Buchanan in a particular reporting unit is excessively large. The
residual for a reporting unit may be large relative to the residuals
for other reporting units merely because the expected support for
Buchanan is truly larger among the voters in that reporting unit. If
one determines whether Buchanan vote in an area is excessively large
by using a test based on simple residuals, the resulting test results
will be biased in the sense of tending to find such excesses when they
do not really exist.20
It is important to understand how this phenomenon occurs. The reason one
expects to see larger residuals when the baseline support for Buchanan is truly
bigger is that as the baseline proportion of votes for Buchanan increases from
zero up to 0.5, the variance of the actual proportion of votes around the
baseline expected value increases. This means that for any particular
``large'' size for a possible residual that one might specify (within the range
zero to
), the chances of seeing a residual as large as that size
increase as the baseline proportion increases. If
is the baseline expected value and one analyzes
the vote for Buchanan while treating the total number of votes
as a fixed
quantity (known as conditioning on the total), then the variance of
is
To make the discrepancies from different reporting units comparable to one
another it is necessary to eliminate the variations that stem from the
heteroscedasticity (differing variances) among the observed votes for Buchanan.
The way to do that is to divide each simple residual by the square root of the
variance
var
. In this way we compute what's
known as the studentized residual,
:
For each state we estimate a separate set of parameter values
,
and
of equation (1) and
overdispersion value
. The studentized residuals are comparable
across the reporting units from each state and also across states.
To implement a more powerful assessment of the discrepancy for each reporting
unit, we use a jackknife method: the parameter values used to compute the
residual for reporting unit
are estimated using the data from all the
reporting units in the same state as
but omitting the data for
. The
histogram in Figure 1 shows the jackknife studentized residuals
from counties in Florida. The histogram in Figure 2 pools such
residuals from all 46 states for which the model of equation
(1) could be estimated.