Discussion: View Thread

  • 1.  Interaction Significant, Simple Main Effects Not Significant

    Posted 07-04-2012 16:22
    Dear All,

    I have a logistic regression model as below

    logit(y)=b0+b1*x1+b2*x2+b3*x3+b4*x4+b5*x3*x4

    now in this model x3*x4 and x4 are significant while x3 is not but if I  remove the interaction term then x3 is significant and x4 is not. Both x3 and x4 are scores based on two different questionnaire one of them is the mean of the overall score while other is the total.

    If I calculate the Spearman's rank correlation between them it is 0.41 (p-value<0.0001) while Pearson's correlation is 0.399(p value<0.0001), the sample size is 566.

    I believe , a significant interaction without  one of the main effect  being significant is the outcome of multicolinearity but I would really appreciate  any comments and suggestions from the group members  on the interpretations of this issue.


    Best Regards,
    Tasneem

    PS: There were other scores in the model whose correlations were higher than the above but their interaction didn't turn out to be significant.


  • 2.  RE:Interaction Significant, Simple Main Effects Not Significant

    Posted 07-04-2012 16:32

    You can have a significant interaction without significant main effects and without collinearity; collinearity is a relationship among the independent variables; effects (whether main or interaction) are about relations between the dependent and independent variables

    In your case, one IV is significant and the other is not, and the interaction is. It is easy to create data that matches this. In R

    x1 <- rnorm(100)
    x2 <- rnorm(100)
    int <- x1*x2
    ylin <- 10*x1 + 5*x1*x2 + rnorm(100, .1, .02)
    ylog <- ylin > 0.8
    model <- glm(ylog~x1+x2+x1*x2)

    and there is no correlation between x1 and x2


    -------------------------------------------
    Peter Flom
    -------------------------------------------








  • 3.  RE:Interaction Significant, Simple Main Effects Not Significant

    Posted 07-05-2012 11:47
    Peter is absolutely correct.

    Interaction terms may, however, be collinear with main effects. You can alleviate this by centering the variables around their mean before you create the interaction term. I think you simply have a significant interaction, but I thought I would pass along this information as well.

    -------------------------------------------
    JuliAnna Smith
    Methodology Consultant
    Center for Research on Families
    -------------------------------------------








  • 4.  RE:Interaction Significant, Simple Main Effects Not Significant

    Posted 07-04-2012 16:44
    A significant interaction essentially indicates that the response for one variable depends on the level of another variable. Removing the interaction from the model does not make this relationship go away. Because of this relationship, significance of  the variables (ie main effects) should not be tested because such significance is overall statement; an overall statement can not be made because the interaction exists.
    David

    -------------------------------------------
    David Bristol
    Statistical Consulting Services
    -------------------------------------------








  • 5.  RE:Interaction Significant, Simple Main Effects Not Significant

    Posted 07-05-2012 08:21
    here is my take:

    1. In the scope of studies we do, you can almost never prove that no clinically important interaction exists.  There is no statistical test that can prove this, outside of huge studies whose coinfidence regions are tightly wound about zero. So assume interactions exisr.

    2. Main effects can be hugely impoortant even when interactions exist.  For example in a 2 by 2 factorial study, main effects represent the average treatment A effect over the two settings of B.  For example, whether or not there is a treatment by gender interaction in a clinical trial, the main effect of treatment is an important parameter to estimate, and likely is the primary outcome variable in the study.

    Jon

    -------------------------------------------
    Jon Shuster
    University of Florida
    -------------------------------------------








  • 6.  RE:Interaction Significant, Simple Main Effects Not Significant

    Posted 07-06-2012 07:56
    Both answers sound reasonable to me. In an "intention to treat" analysis, report the main effects. In a "per protocol" analysis, emphasize the interactions.

    Chuck

    -------------------------------------------
    Charles White
    Senior Biostatistician
    Walter Reed Army Institute of Research
    -------------------------------------------








  • 7.  RE:Interaction Significant, Simple Main Effects Not Significant

    Posted 07-05-2012 08:24
    There may be counterexamples, but when I have encountered that type of pattern, it is typically related to collinearity as you suggest, but usually involving the crossproduct term even more strongly than the correlation of the two linear terms.  To confirm I would look at the correlation matrix (and if possible the eigenvalues) of the three variables x3, x4, and x3*x4.  It is sometimes possible to reduce the collinearity by expressing the crossproduct term as (x3-x3mean)*(x4-x4mean).

    But regardless, using the hierarchical principle of including all lower order terms if a higher order term is significant, I would keep all three terms x3, x4, and x3*x4 in the model.

    Best regards,

    -------------------------------------------
    Michael Morton
    -------------------------------------------








  • 8.  RE:Interaction Significant, Simple Main Effects Not Significant

    Posted 07-05-2012 11:15
    Dear All,
    Thank you for your suggestions and comments, they are very helpful.
    I really appreciate your time.
    Best regards,
    Tasneem

    -------------------------------------------
    [Tasneem] [Zaihra]
    [Post Doctoral Fellow]
    [McGill University]
    -------------------------------------------








  • 9.  RE:Interaction Significant, Simple Main Effects Not Significant

    Posted 07-05-2012 17:46
    Dear Tasneem,

    We need to remind ourselves that we must always think about the data generating process when addressing questions like this. My favorite example is to imagine that we are sampling from a population of rectangles and we have data on A=area, L=length, and W=width.

    Clearly, the best model is A=L*W. No main effects and no intercept. Just an interaction term.

    I hope this helps you think through your problem.

    Best regards,

    -- Tom

    -------------------------------------------
    Thomas Sexton
    Professor and Associate Dean
    Stony Brook University
    -------------------------------------------