Hello Tanaya,
I rather have questions than answers:
1. Why would you do a hypothesis testing?
a. I doubt that the sample is random.
b. It seems to me that it was not a planned situation.
c. What would be the target population for the intended inference?
2. I would rather calculate some descriptive statistics.
a. With 30 averages(I suppose that you are taking the average for each woman; otherwise your degrees of freedom will be inflated) per woman and 20 groups (5x4) there are, on the average, 1500 women. With those data you can have an idea about the corresponding population distribution.
b. Most probably there are other grouping factors, ethnicity, etc. So that you will have still some mixture in your 20 groups.
Regards,
Rene Valverde-Ventura
------------------------------
Rene Valverde-Ventura
------------------------------
Original Message:
Sent: 05-22-2020 13:43
From: Tanaya Kavathekar
Subject: Hypothesis testing
Hello,
I have a database of about 30,000 women who have tracked their weight throughout pregnancy. I have other information such as age and bmi group (BMI before pregnancy). The ages are group into 5 years bracket. So in total there 5 age groups and 4 bmi groups. The number of observations in the groups is uneven.
I want to test if there are statistical differences between the weight difference in the age/BMI group. My null hypothesis is the mean of the weight difference is the same across 5 age groups. I am currently using ANOVA in R. This test shows me a message "Estimated effects may be unbalanced"
Am I using the correct test? Is there any other test that works best with the unbalanced data?
Thank you,
Tanaya
------------------------------
Tanaya Kavathekar
George Washington University
------------------------------