Dear Dr. Landon,
I would recommend reading up on CANONICAL CORRELATION analysis. This is essentially a technique that simultaneously explores & tests the relations of a SET of variables (e.g., y1, y2, y3, etc.) versus a SET of variables (e.g., x1, x2, x3, etc.). For example, check the internet for SAS stats documentation re the CANCORR procedure ("Proc Cancorr").
Joseph J. Locascio, Ph.D.
------------------------------
Joseph J. Locascio, Ph.D.,
Assistant Professor of Neurology,
Harvard Medical School,
and Statistician,
Memory and Movement Disorders Units,
Massachusetts Alzheimer's Disease Research Center,
Neurology Dept.,
Massachusetts General Hospital (MGH),
Boston, Massachusetts 02114
Phone: (617) 724-7192
Email:
JLocascio@partners.org
Original Message:
Sent: 10-30-2015 09:27
From: Linda Landon
Subject: Principal Components Analysis
Everyone,
I'm about to make my first foray into principal component analysis since graduate school. I'm working on a multivariate regression problem that has multiple independent variables and multiple dependent variables. I'm wondering if it would be possible to perform multivariate regression by using BOTH the principal components representing dimensions in the set of independent variables as independent variables and the principal components representing dimensions in the set of dependent variables as dependent variables in the same regression model. However, I'm unsure if this application is statistically valid.
The common examples found in texts and in the peer-reviewed literature when PCA is applied prior to multivariate regression are
(1) multiple independent variables to find the principal components in the independent variables and then one or more components are used as independent variables in a regression against a single dependent variable or
(2) to multiple dependent variables to find the principal components in the dependent variables and then a single component is used as a dependent variables in a regression against a more or more independent variables.
However, I would like to know if it is possible to perform one multivariate regression analysis on two sets of principal components, one set of components representing the dimensions in the independent variables and the other set of components representing the dimensions in the dependent variables. In my rather limited literature search, I haven't found an example of this application in the peer-reviewed literature, which could indicate that it is not statistically valid. Further, other sources of information don't explicitly address this application.
Thanks in advance for your insight.
Linda
Linda A. Landon, PhD, ELS
President
Research Communiqué
Jefferson City, MO
Email: LandonPhD@ResearchCommunique.com
Phone: 573-797-4517
Central Standard Time ( CST ) = GMT-6 (November – February)
Central Daylight Time ( CDT ) = GMT-5 (March –October)