Thank you for your excellent questions, Jonathan! Please see my answers below.
Question 1, 2 and 3The subjects in this study are patients who received a stem cell transplant (SCT).
The event of interest (Event_1) is
clearance of FISH abnormalities post-SCT. Patients were tested for (genetic) abnormalities once sometime prior to the transplant and once sometime after the transplant using the FISH test. If the abnormalities present pre-SCT were found to have cleared post-SCT, the patients were deemed to have experienced Event_1. The date when the clearance occurred post-SCT (if it did occur) is known for all patients who received the FISH test pre- and post-SCT. However, some patients received the FISH test pre-SCT but not post-SCT, which means that, for those patients, we do not know whether or not they would have experienced clearance of abnormalities. The time when they would have experienced clearance is also not known.
The competing event (Event_2) is
Death. We do know, for all patients of interest, whether or not they died by the end of the study. For those who died, we know their exact death date in relation to the date of the transplant. Patients were monitored for several years (up to about 14-15 years for some patients).
So there is no periodic assessment of when the clearance might have occurred - this is assessed just once during the study period, though how the clinicians decide
when a patient should be assessed and
why I do not know. Whether they (i) deemed the patient didn't warrant follow-up testing for clearance of abnormalities based on clinical indications, (ii) forgot to administer the test post-SCT even though they did administer it pre-SCT or (iii) administered the test post-SCT but didn't record the result - there really is no such information available.
Even for patients who were tested for clearance post-SCT, I guess it is conceivable (?) that, if a patient would have been tested at a later date than the actual date when they were initially tested without achieving clearance, they might have achieved clearance at the subsequent test date despite not achieving clearance at the earlier date.
Question 4: Was there a particular individual or site who did this (or other systematic cause) or was the missing data distributed more or less randomly?It is my understanding that these patients all come from a single cancer agency. I can ask whether all of these patients were seen by the same doctor. Not sure whether any other systematic cause would have been responsible for these patients not being tested for clearance post-SCT.
Question 5: Is there an event hierarchy here? Can one event precede the other but not vice versa?Patients can experience post-SCT clearance first and then die. Or they can die before having a chance to experience clearance, in which case death precludes clearance. So Event_1 (Clearance) can precede Event_2 (Death), and if Event_2 occurs, Event_1 can no longer occur.
Question 6: If you have an imputation method you can really believe in, why wouldn't you also use it to impute the event date for patients whose events hadn't occurred by the end of the study? Why just use it for the patients with no event data at all?
I don't really have an imputation method I can really believe in, hence my question on this forum. (:
For patients who were tested for clearance post-SCT but were found not to have cleared their abnormalities, I guess I am making the implicit assumption that they would have maintained the same non-clearance status until the end of the study or death, whichever comes first. Maybe this assumption is justified by the fact that they would have been tested for clearance just once or maybe it is not.
For patients who were NOT tested for clearance post-SCT, my predicament is that I do not know which of them would have rendered a positive test for clearance of abnormalities and which of them would have rendered a negative test post-SCT. I can assume the two extremes: all would have rendered a positive test and all would have rendered a negative test. This way, I would fill in the values of the Indicator_1 variable with all 1's or all 0's and only worry about imputing the time to event. But I still don't know what the best way to do that is, given that I also have information about Indicator_2 and Time_2, as well as some covariates. What about the situation where some patients render a positive test and the other a negative test?
------------------------------
Isabella Ghement
Ghement Statistical Consulting Company Ltd.
------------------------------
Original Message:
Sent: 01-16-2019 09:28
From: Jonathan Siegel
Subject: Missing value imputation in a competing risks setting
I have a couple of questions about this.
1. How do we know when the events occurred? Can we tell an exact date by looking once at the end of the study, or do we have to assess periodically as the study is going on?
2. If periodically, what was the duration of the study and how frequently were the assessments made?
3. How did the missing patients come to get assessed for one event but not the other (possibly at every assessment)?
4. Was there a particular individual or site who did this (or other systematic cause) or was the missing data distributed more or less randomly?
5. Is there an event hierarchy here? Can one event precede the other but not vice versa?
6. If you have an imputation method you can really believe in, why wouldn't you also use it to impute the event date for patients whose events hadn't occurred by the end of the study? Why just use it for the patients with no event data at all?
Note: Censoring at day 1 (or, if a subsequent event in an event hierarchy, the day after the preceding event or end of study if the preceding event never occurs) would be the traditional method if non-informativity of missingness assumptions can be justified.
------------------------------
Jonathan Siegel
Deputy Director Clinical Statistics
Original Message:
Sent: 01-14-2019 20:35
From: Isabella Ghement
Subject: Missing value imputation in a competing risks setting
Hi everyone,
Currently, I am working on a study where I need to compute the cumulative incidence of an event of interest (Event_1) in the presence of a competing event (Event_2).
For each of these events, I have an indicator variable and a time to event variable. In particular:
Indicator_1 = 1 if the subject experienced Event 1 by the end of the study;
= 0 if the subject did NOT experience Event 1 by the end of the study.
Indicator_2 = 1 if the subject experienced Event 1 by the end of the study;
= 0 if the subject did NOT experience Event 1 by the end of the study.
Furthermore,
Time_1 = time to Event 1 (if the subject experienced this event) or to the
end of study (if the subject did NOT experience this event);
Time_2 = time to Event 2 (if the subject experienced this event) or to the
end of study (if the subject did NOT experience this event).
Now, the problem is that for a number of subjects (about 10% of all subjects), the test which was supposed to determine whether or not they experienced Event_1 was not administered (for whatever reasons, which are not known). So these subjects have missing values for both Indicator_1 and Time_1. Thus, for these subjects, we don't know whether or not they would have experienced Event_1 before the end of the study. We also don't know the corresponding time to event (possibly censored).
My questions are: Is it possible, just from the information provided, to impute the missing values for the indicator variable Indicator_1 and the corresponding time variable Time_1? If it is, how can this be done (in R)? If it isn't, what other options should I consider for analyzing these data? (I do have access to some covariates such as age, sex, etc.)
Any comments, references or idea would be greatly appreciated.
Thank you in advance for your help!
Isabella
------------------------------
Isabella R. Ghement, Ph.D.
Ghement Statistical Consulting Company Ltd.
E-mail: <maskemail>Isabella@...</maskemail>
Phone: 604-767-1250
------------------------------