This message has been cross posted to the following eGroups: Young Professionals Group and Statistical Consulting Section .
-------------------------------------------
Hello all:
I thought of this model specification test. Most likely, a similar idea has been developed in the past, and I would like to obtain some references from you.
The setup is as follows. There is a GLM
E[Y] = g(X'beta)
There is a large number of experimental units. Each unit has its own response vector, but the number of observations and the design are the same for all units. The model is fitted separately for each unit.
There are no units for which the true beta or the true response are known. Therefore, to test for possible model misspecification I introduce a binary factor, X_random, whose levels are assigned at random so that it is not associated with Y, and include it in the model:
E[Y] = g( (X | X_random)' beta )
If the model is well specified, the Type III p-value for X_random must be U(0, 1). I can collect p-values across all of the units and test them for uniformity.
Have anyone seen something like this before?
Regards,
Nik