First off, since the interaction is not even close to significant (0.23), this is really a non-problem. No one should ever expect centers to be identical. Some will always have greater means and some lower means. This could be due to many factors: instrumentation, investigator biases, patient population differences (including demographic and baseline medical condition differences). While we tend to analyze the data using a fixed center model, we typically assume that investigators are random, with a population mean and standard deviation. So seeing a difference among centers is to be expected. As others have said, there is NO reason to a posteri pool centers based on their differences. I agree with the consensus that you shouldn't pool.
However, the important question is whether you see a 'qualitative' or 'quantitative' interaction. In your case, the interaction was n.s.
There is often reason to expect some centers might have a larger treatment effect than others. For example, some centers have severely ill patients. In that case, they could have quite large improvement, while the centers with less severe patients have more modest improvements. Small N and differential adherence to the protocol (greater error variance in some centers) also impact on the magnitude of individual center's treatment effect. By this logic, some center's treatment are small and positive and others are large and positive. As the treatment effect among centers is a random effect with a mean and standard deviation, with some centers it might even be numerically negative.
Gail and Simon (Biometrics 1985) wrote the key paper on analyzing for qualitative interactions. The gist of the approach is one looks at the treatment effects per center and examines the ones going in the 'wrong' direction. If the effects going in the wrong way is statistically significant, one would say the interaction was qualitative. Since the interactions are divided between the positive and negative effects, it is usually more difficult to have a qualitative interaction. For example, I just did a simulation for a dichotomous d.v. for a client and observed that overall, the incidence of any interaction (at the 0.05 alpha level) was 5%. However, the incidence of a qualitative interaction was around 0.7% (with an upper 95% CI of 0.9%). I used proc freq's table option of 'GailSimon'.
In sum, ignore center effects (one expects them), ignore a quantitative interaction (one expects them), but test and comment on the qualitative interaction. Since your overall interaction was n.s. tell your client that you are fully justified in reporting on the treatment effect and ignore the center main effect.
-------------------------------------------
Allen Fleishman, PhD, PStat®
President
Allen Fleishman Biostatistics Inc.
-------------------------------------------