As regards the relationship between control charts and testing,
this is a copy of a letter I wrote to the editor of JQT. It was published in 2001.
The article Woodall (2000) and its discussion examine the relationship between control
charts and hypothesis tests. I would like to add, what I consider to be, key references to
this discussion. Shewhart (1986, p.42) states, “For a prediction to have an operationally
definite meaning, it is necessary that there be given or implied a perfectly definite way
of determining whether it is true or false. Hence it is necessary that there be implied an
operationally definite meaning of the statistical state of control in terms of characteristics
of the sequence [of data]. There are two senses in which we may have such a meaning.
One is the theoretical sense in which we include all possible criteria that the mathematical
statistician may impose upon the infinite sequence [of data] as a characterization of what
he means by a mathematical state of control. The other is the practical sense in which one
chooses a limited group of criteria to be appled in some specific way to a finite portion of the
sequence ...” Earlier, on page 40, Shewhart notes that Neyman-Pearson theory “involves
the assumption that the observed data constitute a random sample, and we have already
considered some of the difficulties involved in trying to give this term an empirical and
operationally verifiable meaning. In fact, we may think of the whole operation of statistical
control as an attempt to give such meaning to the term random.”
Given Shewhart’s and Deming’s emphasis on operational definitions, I believe their
argument is that control charts should be viewed as providing an operational definition of
what it means for a process to be under control. Indeed, it is a simple exercise to see that
a basic means chart is sensitive, not only to shifts in the process mean, but also to shifts
in its variance and to positive correlation within the rational subgroups, cf. Christensen
and Bedrick (1997). Thus, means charts provide quite a good beginning for creating an
operational definition of what it might mean to have independent identially distributed
observations. Philosophically, this is a far cry from assuming randomness and testing an
hypothesis.
References
Christensen, Ronald and Bedrick, Edward J. (1997). “Testing the independence assumption
in linear models,” Journal of the American Statistical Association, 92, 1006-1016.
Shewhart, Walter A. (1986). Statistical Method from the Viewpoint of Quality Control.
Dover, New York.
Woodall, William H. (2000). “Controversies and Contraditions in Statistical Process Control,”
with discussion, Journal of Quality Technology, 32, 341-378.
------------------------------
Ronald Christensen
Univ of New Mexico
------------------------------
Original Message:
Sent: 01-17-2017 15:45
From: William Harris
Subject: CLT rule of thumb n>=30: not quite true!
John,
Thanks for posting this. While I don't think Shewhart wrote about tires, I recall something in his Economic Control of Quality of Manufactured Product that sounded a lot like what you wrote about n = 4. From something else I read, I gather that Shewhart's work wasn't derived from Neyman-Pearson thinking but was something he developed himself, almost like a third alternative to Fisher and Neyman-Pearson.
AFAICT, Shewhart-style control charts are still used today, even as statistics has moved forward in other areas. Are they still appropriate, or are there better statistical approaches to help management deal with process variation appropriately? Much of the SPC literature I read seems separate from NHST thinking and from current Bayesian inference and decision making, although I do recall Stu Hunter's Bayesian Approaches to Teaching Engineering Statistics.
Bill
------------------------------
Bill Harris
Data & Analytics Consultant
Snohomish County PUD
------------------------------
Original Message:
Sent: 01-17-2017 14:19
From: John Stickler
Subject: CLT rule of thumb n>=30: not quite true!
As an industrial statistician, some of the data collected about airplane tire parameters became nearly normal by averaging 4 test values. One issue with theoretical statisticians, is that they are great on showing theoretical areas of non-normality, but their ideas my not be practical in the real world of collecting data. In industry we are trying to solve issues by minimizing the cost of collecting data. Of course, we must confirm what we have found.
John Stickler