The procedure you describe implicitly transforms each original ordinal variable from 1 to 5 into an interval variable from 1 to 5, and then proceeds to add or average these interval variables and appropriately interpret the result as an interval variable. You question amounts to asking whether the transformation is appropriate. A suggestion for transformation of ordinal to interval variables is given in Abelson, R. P., and Tukey, J. W., Efficient utilization of non-numerical information in quantitative analysis: General theory and the case of simple order, Annals of Mathematical Statistics, 34(4), 1347-1369, 1963.
------------------------------
James Schmeidler
Icahn School of Medicine at Mount Sinai
------------------------------
Original Message:
Sent: 03-31-2015 05:38
From: Eero Liski
Subject: Analyzing questionnaire data
Dear ASA members,
I am puzzled by the topic of analyzing questionnaire data. Suppose I have a large questionnaire, consisting of groups of questions. One of these groups might consist of questions trying to solve the participants agreement (or disagreement) on how well the state takes care of environmental issues. The answers are collected using an ordered 5-scale Likert scale (1 = total agreement,...,5 = total disagreement). My ultimate goal is to model one of these groups (a variable constructed from the individual answers) using a bunch of socio-demographic and some other explanatory variables.
My understanding is that for example in social sciences people many times calculate Cohen's Kappa to check the inter-agreement. The next step is to combine individual answers within a question-group (for example state and environmental issues). People seem to be fond of adding the individual values, or taking the average. This new variable (based on sum or average) is then treated as a continuous variable. Here is where I am most puzzled. That is, the original variables (individual answers) are ordinal variables, but after some transformation (summing up, average etc.) the new variable is treated as a continuous variable.
Have any of you worked on or researched this sort of problem? I am currently reading a book by Alan Agresti called 'Analysis of Ordinal Categorical Data', which deals with ordered rather than nominal responses. I am not yet sure whether this book gives me all the answers to this puzzling problem. I have only glanced through some social sciences -books, but those seemed to side-step this potential problem moving from ordinal scale to interval. They usually just present a method of combining the individual answers and simply move on.
I would very much appreciate any responses or suggestions. Especially, if someone knows a book or a paper where this issue is explicitly tackled, I would be more than happy.
Best wishes,
Eero Liski
------------------------------
Eero Liski
University of Tampere
Finland
------------------------------