This message has been cross posted to the following eGroups: Statistical Computing Section and Statistical Consulting Section .
-------------------------------------------
Dear all,
A friend of mine has predicted a binary event (low back pain) with two models (logistic regression and artificial neural network). She calculated the area under the ROC curves for comparing the predictive ability of these two models and the area under the ROC curves are approximately the same. But the STATA output shows a significant difference between these two curves. She has sent her article to a journal and because the areas are the same and the p-value is significant the reviewer thought the calculation is wrong.
. roccomp LBP Logit Neural
ROC -Asymptotic Normal--
Obs Area Std. Err. [95% Conf. Interval]
-------------------------------------------------------------------------
Logit 16233 0.7523 0.0045 0.74353 0.76108
Neural 16233 0.7536 0.0045 0.74489 0.76239
-------------------------------------------------------------------------
Ho: area(T_L) = area(T_N) chi2(1) = 8.54 Prob>chi2 = 0.0035
We think as the confidence interval for the areas overlap in a large portion, the reviewer thought the calculation is wrong. The covariance matrix is as follows:
var(area_Logit)= .00002004 var(area_Neural)= .00001992 cov(area_Logit, area_)=.00001988
She wants to answer the reviewer objection in this way:
"As we can see from the output, the two confidence intervals overlap in a large portion and it seems the null hypothesis, (the equality of two ROC curve) should not be rejected. However, because these two models applied to the same data set they are correlated (corr=.99). We should consider this correlation in the hypothesis testing. As a result, the statistical test for equality of these two areas is as follows:
X^2= (0.7523-0.7536)^2 / (0.00002004+0.00001992-2*0.00001988)=8.45
and we believe the significant result is not wrong."
She has searched about the way in STATA12 to compare two ROC curve but she couldn't find a desirable result.
If STATA uses the same way as she did, please let us know. If not, please introduce us a good reference to know more about the comparing algorithm.
Also please let us know if her way to calculate the chi-squared statistic for this kind of comparing is true or not?
Your contribution and discussion will be appreciated.
Bunch of thanks,
Amir
-------------------------------------------
Amir Kasaeian
PhD Student in Biostatistics
Tehran University of Medical Sciences (TUMS)
amir_kasaeian@yahoo.com akasaeian@razi.tums.ac.ir -------------------------------------------