Wayne:
Is there some historical or administrative reason that a particular set of predictor variables
might be included, even if many of them don't have significant coefficients in the current model?
For example, someone might have done some work back in, say, the 70s that produced the current
set of explanatory variables to be included, and as a result they are always included when a new
model is done. (This is also a backhanded way to think you don't need to publish any overall
fit statistics, if you're going to always use the same variables regardless.) The cut-offs you mention
could be historical as well.
Another possible reason to include "extra" variables is to make the variance of the fitted dependent variable
as small as possible (by driving down whatever the moral equivalent of R-squared is),
even if it means that you're including some variables that are correlated with
each other and thus decreasing the reliability of the fitted coefficients. I don't like this tack, since it leads
to black-box models that are over-fitted and data-dependent, but others' mileage may vary, especially
if "we've always included this set of variables in our models." But I think your instincts are right for this situation:
you might be able to do something with the fitted value (E), but you can't rely on "looking under the hood"
at the individual coefficients to help determine a course of treatment. In particular, you can have the nasty
situation that among a set of correlated variables, the real explanatory variable loses out in the
luck-of-the-draw "battle" for significance to one of its proxies (e.g., if smoking incidence is highly correlated
with gender, and smoking is the genuine risk factor, but gender "wins" the battle and thus leads a clinician
into thinking, "well, gender has a marginally significant coefficient and smoking does not, so let's
differentiate treatment by gender").
Hope this helps,
>>Kathy
-------------------------------------------
Katherine Godfrey
-------------------------------------------