In general I would not. I think the most important reason is that fhe existence and extent of missing data represent important descriptive facts about the phenomena being observed and/or your observation methods, and hence are relevant to any effective description. If a large portion (or only a small portion) of the data are missing, this would be important to know and relevant to the data's interpretability.
I agree that, because imputation methods require assumptions, they go beyond simply describing the data observed. In addition, they may result in underestimating, perhaps substantially, variances and CIs, and hence in overestimating precision.
There me are situations where it is especially important to avoid imputation in ones descriptions. In situations involving heavy tails, For example, outliers such as long-term survivors, uncommon toxicity events, very high-income individuals, stock-market crashes, etc., are often very important to an effective description of the phenomenon being evaluated. Imputation methods tend to result in under-reporting or underestimating the impact of such phenomena.
------------------------------
Jonathan Siegel
Associate Director Clinical Statistics
Original Message:
Sent: 07-26-2016 01:38
From: Md Abdullah Mamun
Subject: Should we impute missing data while presenting descriptive stat?
Most of the proposed methods for missing data imputation are guided to regression analysis. Should (or Can) we impute missing data while the objective is merely to present some descriptive statistics (mean, SD, mode) in the preliminary tables of a manuscript? If yes, which method is appropriate? I am familiar with mean imputation, stochastic imputation, and multiple imputation. Given that the missing data met the MCAR or MAR criteria.
Thank you in advance,
Mamun
------------------------------
Md Abdullah Mamun
PhD Student
UNTHSC
------------------------------