Discussion: View Thread

Back to discussions

Expand all | Collapse all

Choosing Statistical test for an experiment in biology

1. Choosing Statistical test for an experiment in biology

Recommend

Tasneem Zaihra Rizvi

Posted 08-25-2015 10:39

Dear All,

A colleague of mine in the Dept of Biology is working on gastrointestinal motility. He is studying gastrointestinal motility using the zebrafish. He want to know if gastrointestinal transit changes after an experimental ‘condition (he knocks out genes). GI transit is the time it takes for food to pass through the gut. They merely feed zebrafish food with a marker and actually look at every fish at time intervals. They can see the marker in the GI tract, and therefore they know how much it has moved. They measure it at specific times. The data is scored- so if marker is in zone 1 we call it ‘1’, and if it is moved nearly all of the way through the GI tract they call it ‘4’. Once the GI tract is empty they call it ‘5’. So, for every fish they have numbers like 1,2,3,5. That would indicate the food moved progressively through the GUI tract and at the end of the experiment (24 hours) it is empty.

They want to know if the pattern, or transit changes with an experimental condition? The only paper using this technique used a Pearson’s Chi Square. They had between 500 and 1400 observations at every time point.

Here is a sample of their data:

	Control Uninjected				25 µM TB/SA Mix
Fish	0 hours	4 hours	8 hours	24 hours	0 hours	4 hours	8 hours	24 hours
1	2	3	3	5	1	3	3	5
2	1	5	5	5	1	3	4	5
3	1	5	5	5	2	4	4	4
4	1	3	3	5	1	2	3	x
5	3	3	5	5	2	5	5	5
6	2	3	5	5	2	4	5	5
7	1	5	5	5	3	3	4	4
8	2	5	5	5	3	2	2	4
9	1	4	4	5	2	2	3	3
10	1	3	4	4	1	3	4	5
11	1	4	4	5	1	3	4	4
12	2	4	5	5	1	5	5	5
13	2	2	4	x	1	3	5	5
14	1	4	5	5
15	1	3	4	5
Average Zone	1.5	3.7	4.4	4.9	1.6	3.2	3.9	4.5

An X means the fish died.

I do feel we can do much better than a simple chi-squared tests or a perhaps a Wilcoxon test. Any suggestions/ comments would be really appreciated.

Best Regards,

Tasneem

------------------------------
[Tasneem] [Zaihra]
[Post Doctoral Fellow]
[McGill University]
------------------------------

2. RE: Choosing Statistical test for an experiment in biology

Recommend
David Bristol
Posted 08-25-2015 11:04
Tasneem,

You can do "better".

Off the top of my head:

(1) Time to '5'

(2) Markov chain with 5 states (compare vectors of transition probabilities)

(3) Isotonic regression for ordinal data.

Deaths are an issue with (2) and (3)

David

------------------------------
David Bristol
Statistical Consulting Services
------------------------------
3. RE: Choosing Statistical test for an experiment in biology

Recommend
Eric Siegel
Posted 08-26-2015 21:35
Although at first glance it looks like nonparametric longitudinal data, I'm thinking No, Not Really. It actually looks like time-to-event data. For example if 5 means Expelled, then one could ask, how long did it take to observe a 5 in each zebrafish, and how many zebrafish are right-censored for achieving a 5 after 24 hours. I appreciate there appears to be a whole lot of interval-censoring going on in addition to the right-censoring, but I might still start with Kaplan-Meier curves to create the initial picture, and I might still use a log-rank test to compare the curves.

I imagine one could adopt the same analysis approach with respect to the time it takes each fish to to achieve a 4, a 3, or a 2. But if one pursued the times to achieve those events, then I would need to ask: does the food ever move backwards in any of the fish?

------------------------------
Eric Siegel
Biostatistician
Univ of Arkansas for Medical Sciences of Biostatistics
------------------------------
4. RE: Choosing Statistical test for an experiment in biology

Recommend
Isabella Ghement
Posted 08-26-2015 23:57
Why was the experimental group injected with 25 µM TB/SA Mix?

The injection could have been intended to either accelerate or slow down the passing of the food through the gut.

The comparison of "patterns" across groups should ultimately reflect the purpose of the injection.

If the injection was intended to accelerate the passing of food through the gut, one could compare the two groups with respect to the outcome "The first time the food reached zone 5 of the GI tract". This outcome will be censored for fish who died or who did not reach zone 5 of the tract.

------------------------------
Isabella Ghement
Ghement Statistical Consulting Company Ltd.
------------------------------

Original Message
5. RE: Choosing Statistical test for an experiment in biology

Recommend
Deborah Dawson
Posted 08-27-2015 10:37
I agree that a time-to-event approach would be helpful, and would also support looking at time to achievement of movement to each zone. I believe you would need to use interval censoring in this case, since you do not have exact time of food arriving in a particular zone, but identify status at specific time intervals.

You might take a look at the "interval" package in R (see Fay and Shaw reference below).

I believe SAS can also be used, based on a paper by So, Johnston and Kim in SAS Global Forum 2010; there is also a procedure called ICLIFETEST, although I have not used it myself.

Some references:

Chen D-G, Sun J, Peace KE (2013). Interval-Censored Time-to-Event Data: Methods and Applications. Chapman and Hall.

Fay, MP (1996). "Rank invariant tests for interval censored data under the grouped continuous

model". Biometrics, 52: 811-822.

Fay, MP (1999). "Comparing Several Score Tests for Interval Censored Data." Statistics in Medicine,

18: 273-285 (Correction: 1999, 18: 2681).

Fay, MP and Shaw, PA (2010). Exact and AsymptoticWeighted Logrank Tests for Interval Censored

Data: The interval R package. Journal of Statistical Software. http://www.jstatsoft.org/v36/

i02/. 36 (2):1-34.

Fay, MP and Shih JH. (2012). Weighted Logrank Tests for Interval Censored Data when Assessment

Times Depend on Treatment. Statistics in Medicine 31, 3760-3772.

Fay, MP and Hunsberger, SA. (2012). Practical Issues on Using Weighted Logrank Tests with

Interval Censored Events in Clinical Trials. Chapter 13 in Interval-Censored Time-to-Event Data:

Methods and Applications, Chen, D-G, Sun, J, and Peace, KE (editors) Chapman and Hall/CRC.

Finkelstein, DM (1986). "A proportional hazards model for interval censored failure time data"

Biometrics, 42: 845-854.

Huang, J, Lee, C, Yu, Q (2008). "A generalized log-rank test for interval-censored failure time data

via multiple imputation" Statistics in Medicine, 27: 3217-3226.

Sun, J (1996). "A non-parametric test for interval censored failure time data with applications to

AIDS studies". Statistics in Medicine, 15: 1387-1395.

------------------------------
Deborah Dawson
Director of Biostatistics
University of Iowa
------------------------------

Original Message
6. RE: Choosing Statistical test for an experiment in biology

Recommend
Jason Machan
Posted 08-27-2015 10:55
1) TE using lifetable method would do trick...but will likely experience resistance to distance between conventional approach and descriptions. Also more challenging for higher-order effects or linear contrasts...limiting.

2) Alternative...probably what I'd do: Binomial...0-5 successes out of 5 trials...watch your residuals, though...obviously not the exact right distribution to arise out of phenomenon, but proportions with 95%CI can easily be back-translated to original scale by multiplying by 5...and get to use standard linear modeling (in SAS, proc glimmix would handle easily).

------------------------------

Jason T. Machan
Director, Lifespan Biostatistics Core,
Lifespan Hospital System
Research Scientist, Biostatistics, Research
Rhode Island Hospital
Assistant Professor, Departments of Orthopaedics and Surgery
The Warren Alpert Medical School, Brown University
Director Biostatistics Externship, Adjunct Assistant Professor, Department of Psychology
University of Rhode Island
------------------------------
7. RE: Choosing Statistical test for an experiment in biology

Recommend
Jason Machan
Posted 08-27-2015 12:11
Additionally, someone sent me a note after I'd replied. Part of that note made me realize I didn't address the repeated measures nature of the data.

I suspect the function will be close enough to linear (in logit-scale) to use a growth model for binomial data (g-side, random intercept and slope). If not, could treat each assessment time as categorical and use correlated error model (r-side, correlated error across assessment times with 1^st order autoregressive or otherwise). …throwing classical sandwich estimation at it for good measure.

Will need to reformat data to be long, rather than wide...one observation per fish per assessment time (make a variable). Also add a denominator for the binomial model (e.g. max_score=5; in the datastep somewhere). Then,

Growth Model

proc glimmix plots=(ResidualPanel PearsonPanel) empirical;

class Fish Group;

model score/max_score = Group | Assessment_Time

/dist=binomial;

estimate "slope in group 1" Assessment_Time 1 Group*Assessment_Time 1 0,

"slope in group 2" Assessment_Time 1 Group*Assessment_Time 1 0

/cl adjdfe=row stepdown ilink;

random intercept Assessment_time/subject=Fish type=csh;

run;

...tweak variance-covariance matrix if necessary, though classic sandwich estimation (empirical) will "fix" a lot.

...graph to get a gut-check.

Correlated Residual Error model

proc glimmix plots=(ResidualPanel PearsonPanel) empirical;

class Fish Group Assessment_time;

model score/max_score = Group | Assessment_Time

/dist=binomial;

lsmeans Group*Assessment_Time

/cl adjdfe=row stepdown ilink;

random Assessment_time/subject=Fish type=ar(1) residual;

run;

...tweak variance-covariance matrix if necessary, though classic sandwich estimation (empirical) will "fix" a lot.

...graph to get a gut-check.

------------------------------
Jason T. Machan
Director, Lifespan Biostatistics Core,
Lifespan Hospital System
Research Scientist, Biostatistics, Research
Rhode Island Hospital
Assistant Professor, Departments of Orthopaedics and Surgery
The Warren Alpert Medical School, Brown University
Director Biostatistics Externship, Adjunct Assistant Professor, Department of Psychology
University of Rhode Island
------------------------------

Original Message
8. RE: Choosing Statistical test for an experiment in biology

Recommend
Jason Machan
Posted 08-27-2015 12:13
"slope in group 2" Assessment_Time 1 Group*Assessment_Time 1 0

should be

"slope in group 2" Assessment_Time 1 Group*Assessment_Time 0 1

Sorry!

------------------------------
Jason T. Machan
Director, Lifespan Biostatistics Core,
Lifespan Hospital System
Research Scientist, Biostatistics, Research
Rhode Island Hospital
Assistant Professor, Departments of Orthopaedics and Surgery
The Warren Alpert Medical School, Brown University
Director Biostatistics Externship, Adjunct Assistant Professor, Department of Psychology
University of Rhode Island
------------------------------

Original Message

Discussion: View Thread

Choosing Statistical test for an experiment in biology

Tasneem Zaihra Rizvi08-25-2015 10:39

David Bristol08-25-2015 11:04

Eric Siegel08-26-2015 21:35

Isabella Ghement08-26-2015 23:57

Deborah Dawson08-27-2015 10:37

Jason Machan08-27-2015 10:55

Jason Machan08-27-2015 12:11

Jason Machan08-27-2015 12:13

1. Choosing Statistical test for an experiment in biology

2. RE: Choosing Statistical test for an experiment in biology

3. RE: Choosing Statistical test for an experiment in biology

4. RE: Choosing Statistical test for an experiment in biology

5. RE: Choosing Statistical test for an experiment in biology

6. RE: Choosing Statistical test for an experiment in biology

7. RE: Choosing Statistical test for an experiment in biology

8. RE: Choosing Statistical test for an experiment in biology