There are no transformation done with Likert items, a priori, but the correlations implicitly standardize the items across cases. Ipsatizing the cases seems like overkill for students. Ipsatizing might be of interest in very advanced psychometric studies.
Conventionally, Likert items are treated as not severely discrepant from interval. Their sum is treated as not severely discrepant from interval level.
There are two major approaches to factor analysis. Both major approaches use the iter-item Pearson correlation matrix. The difference is largely in what is used on the principal diagonal. The kind used in attitude scale construction, principal axes (PAF), is interested in the common variance among the items and treats the variances that are unique to the items as noise so it uses estimated of relaibility (usually the squared multiple correlation of each item with the other items). The other approach, principal components (PCA) assumes that all of item variance is of interest. [PCA is more frequently used in contexts where the variables are measured at the ratio level, e.g., wit physical constructs.
Each item is considered a rough measurement of a construct. The sum of the items is considered a more valid and reliable measure of the construct. There is no substantive meaning to zero in attitude measures.
There are refinements and variation in approach that differ according to specifics of the situation.
If the attitude items are parts of pre-existing validated Likert scales, designed to measure specific constructs, a different approach would be used.
Assuming that the are two sets of items that are designed to explore what structure there might be within each set, and further to see whether the two groups differ in where they are located on the derived dimensions, I would stick with a very conventional attitude scale analysis.
Use principal axis factor analysis with varimax rotation to maximize differential validity.
Determine the number of factors to retain in each set . This is an area that has an art aspect. For advanced students parallel analysis would be part of the decision making. For a solution that retains a number of factors. Items that do not load cleanly are not used. Items that do not load at all are not used. Typically scales to represent a factor would be created only when there were at least 3 and preferably 4 or 5 items that go together and make sense as measures of some construct.
If the set of items is well designed some items will need to be "reflected".
Scale score are created as means of sets of items that go together. An item is only used in one scale. Each item has the same weight - one. Using weighted sums of items frequently failed when scales were used across subpops and studies.
If the instrument administration was not done carefully, there may be missing data. If a factor analysis done with listwise deletion of missing cases show that the items that did not load or did not load cleanly there may be fewer cases lost when the factor analysis is re-run without those items in the variable list.
Likert items are used in creating Likert scales because people are fairly consistent in using the response scale.
Some form of GLM with the scale scores as DVs can be used to look at the difference between the groups. The simplest would be t-tests.
More complex models of the error should also be tried. The write-up should look at whether the more complex models make a substantive difference in the conclusions.
HTH
-------------------------------------------
Arthur Kendall
Social Research Consultants
-------------------------------------------