Statistical modeling relies on correctly partitioning variation into between-group and within-group sources. But what happens when principal component analysis (PCA)—often used for dimensionality reduction—does not align with the actual inferential goals?
In this talk, we explore a common analytical pitfall: using PCA before group comparisons or classification, where selected principal components may contain little information for distinguishing groups.