Hi everyone,
This is one of those questions where it seems hard to find a clear cut answer. (:
I have an instrument covering several domain and I need to compute a total score for each domain, as well as a total score across domains. All domains include multiple questions whose answers are of the form:
1 2 3 4 Not Applicable
where 1, 2, 3 and 4 reflect increasing degrees of ability of the person being rated and Not Applicable indicates that the question was asked but the rater indicated it was not applicable. (There are multiple raters who answered the questions at two different occasions and the overall intent is to assess intra and inter-rater reliability.)
From my readings, it seems that coding Not Applicable as 0 (say) is a possibility, but it may (?) produce biased score estimates.
Some people suggest that the Not Applicable answers should be treated as missing completely at random, which would justify the use of multiple imputation. This doesn't seem quite right to me - if the question is not applicable, why should we assume it is and assign it a rating of 1, 2, 3 or 4?
Is there a principled way to deal with the Not Applicable answers in order to obtain reliable total scores?
Thanks very much,
Isabella
------------------------------
Isabella Ghement
Ghement Statistical Consulting Company Ltd.
------------------------------