HI Tasneem:
I've conducted cluster analyses with both categorical and continuous data. The Kaufman and Rousseeuw book is a classic. The book by Everitt, Landau and Leese: Cluster Analysis is a very good introductory book that you may also wish to consider. Seber's Multivariate Observations also contains substantive chapters on cluster analysis with sufficient mathematical detail to help you understand the process.
I also find the suite of tools in R to be quite good, and the graphics produced are far better than most other software packages that I have used. Here is a list of some of the packages available for R:
library(cluster) # Kaufman and Rousseeuw Libraries
library(e1071) # Latent class models
library(gclus) # Auxillary tool for graphing and ordering hierarichal cluster solutions
library(mclust) # Model-based clustering algorithm libraries
library(mva) # Additional hierarchial clustering algorithms
library(multiv) # More hierarchial clustering algorithms
Cheers,
-------------------------------------------
John Cornell
Professor
University of Texas Health Science Center
-------------------------------------------
Original Message:
Sent: 10-03-2012 10:59
From: Tasneem Zaihra
Subject: Types of variables to use in cluster analysis
Hi Arthur and Daniel
Thanks for your response to my post, it's very informative. I really appreciate it.
I am trying to group patients with a chronic disease into clusters based on their demographics [such as gender, socio economic status, age etc]as well as other factors such as existence of other co morbidity conditions.
Thanks
Tasneem
-------------------------------------------
[Tasneem] [Zaihra]
[Post Doctoral Fellow]
[McGill University]
-------------------------------------------