Inquiries like this beg me ask: What's the question driving the study and the primary analysis?
For correlations, including ICCs, the critical question usually should be: What is the correlation? If so, then the common (frequentist) analysis calls for computing the estimate and constructing an appropriate confidence interval.
The sample-size problem is: Given a true (population) ICC, and k measurements/members per group, find the number, N, of independent groups required to obtain a desired confidence interval of full-width W. For example, for a study on identical twins, k = 2, N = number of pairs of twins. Maybe ICC.true = 0.70 and you want a tight CI, so W = 0.20.
This is dealt with in:
Bonett, D. G. (2002). Sample size requirements for estimating intraclass correlations with desired precision. Statistics in Medicine, 21(9):1331-1335.
which is implemented by the Nest() function in the ICC R package:
> library(ICC)
> (N <- Nest(est.type = "hypothetical", w=0.20, ICC = 0.70, k = 2,
+
x = NULL, y = NULL, data = NULL, alpha = 0.05))
2
0.7
101 Most people would probably deal with this by conducting a sample-size analysis to assess the statistical power for rejecting H0: ICC = 0 in favor of Ha: ICC > 0. But if you think critically, there is usually little or no scientific value in knowing that the ICC just differs from 0.00. (In the Bayesian denomination, the ICC would likely have a continuous and smooth prior, so ICC = 0 has positive density but zero probability.)
The same issue holds when the analysis involves a simple correlation: It is rarely important to know that X and Y have some non-zero correlation. We really should be asking: What is the X,Y correlation?
The same issue holds for comparing two independent means or proportions or odds. For example, we could be comparing two odds through the odds ratio, OR, from a 2 x 2 contingency table. Ho: OR = 1 is usually what I call a "straw null hypothesis" (one made of straw, so it is easily knocked over). Basing your sample size on the power to reject this just seems rather meaningless to me.
An excellent commentary along these lines is:
Connor, J. T. (2004). The value of a p-valueless paper. Am J Gastroenterol, 99(9):1638-40.
In most studies, one can formulate a given research question into a single "focal" parameter estimate and confidence interval that provide far more information than the p-value from testing a straw null hypothesis. If so, a corresponding sample-size analysis for study planning should be based on obtaining a CI that has adequate tightness and/or adequate
essential power to exclude the all values in some "null interval" for the parameter.
-------------------------------------------
Ralph O'Brien
Case Western Reserve University
-------------------------------------------
Original Message:
Sent: 08-15-2012 20:43
From: Robert Podolsky
Subject: intraclass correlation (ICC) testing?
The ICC can be viewed as estimating variance components, and you can use the variance on the estimated variance components to construct the appropriate test. You could use this approach to then determine power/sample size.
-------------------------------------------
Robert Podolsky
Georgia Health Sciences University
-------------------------------------------