ASA Connect

 View Only
Expand all | Collapse all

Accessible Reading on Statistics vs Mathematics

  • 1.  Accessible Reading on Statistics vs Mathematics

    Posted 08-16-2015 08:21

    A colleague of mine asked me if I had any readings about the differences (and maybe similarities) between Mathematics and Statistics for a Math Ed reading course.

    I did not have anything off hand but figured their must be something.   Any suggestions?



    ------------------------------
    Samuel Cook
    Assistant Professor
    Wheelock College
    ------------------------------



  • 2.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-17-2015 07:48

    How is a raven like a writing desk?

    Statistics is a science (the science of uncertainty) like physics, that uses mathematics.

    It is not mathematics.

    I think it is best to have students read maerial that shows them what statistics is and let them draw their own comparisons.

    I think that Paul Velleman and  Dick Deveaux may have done something in comparisons, but their books on statistics are clear. Also some of my recent books mahy be helpful:

    Truth or Truthiness: Distinguishing Fact from Fiction by Learning to Think like a Data Scientist. New York: Cambridge University Press, 2016.

    Medical Illuminations: Using Evidence, Visualization & Statistical thinking to Improve Healthcare.  London: Oxford University Press, 2014 (finalist Royal Society Winton Book Prize).

    A Statistical Guide for the Ethically Perplexed (with L. Hubert). New York: Chapman and Hall, 2013.

    Uneducated Guesses: Using Evidence to Uncover Misguided Education Policies. Princeton, NJ: Princeton University Press, 2011 (finalist for the Grawemeyer/World Order Award)

    Picturing the Uncertain World: How to Understand, Communicate and Control Uncertainty through Graphical Display. Princeton, NJ: Princeton University Press, 2009.



    ------------------------------
    Howard Wainer
    Distinguished Research Scientist
    National Board of Medical Examiners
    ------------------------------




  • 3.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-17-2015 10:08

    There is a recent book by Jordan. Ellenberg which covers many of these tropics. He isa. Mathematician whose parents are both staatisticcians

    ------------------------------
    Peter Lachenbruch
    ------------------------------




  • 4.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-17-2015 12:00

    I think the essential difference is that mathematics is about whether the conclusions follow from the assumptions. By contrast, statistics is about whether the assumptions have anything to do with the real world.

                            Jay Kadane

    ------------------------------
    Joseph Kadane
    Leonard J. Savage Professor of Statistics, Emeritus
    Carnegie-Mellon University
    ------------------------------




  • 5.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-18-2015 11:28

    Well put! May I quote you, with attribution?

    ------------------------------
    Douglas Langbehn
    Professor
    Psychiatry and Biostatistics (secondary)
    University of Iowa
    ------------------------------




  • 6.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-21-2015 04:24

    To echo Jay, I think the essential difference is that
    statistics is about whether the assumptions have anything to do
    with the real world, and mathematics is about whether statistics
    can ever find out.


          Judea Pearl
          Home: http://www.cs.ucla.edu/~judea/
          Results: http://bayes.cs.ucla.edu/csl_papers.html
          Blog: http://www.mii.ucla.edu/causality/.


    ------------------------------
    Judea Pearl
    University of California, Los Angeles
    ------------------------------




  • 7.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-21-2015 04:26

    To echo Jay, I think the essential difference is that
    statistics is about whether the assumptions have anything to do
    with the real world, and mathematics is about whether statistics
    can ever find out.


          Judea Pearl
          Home: http://www.cs.ucla.edu/~judea/
          Results: http://bayes.cs.ucla.edu/csl_papers.html
          Blog: http://www.mii.ucla.edu/causality/.


    ------------------------------
    Judea Pearl
    University of California, Los Angeles
    ------------------------------




  • 8.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-18-2015 10:01

    George Box had this to say about the difference between statistics and mathematics (Technometrics August 1990). It is one of my favorite quotes.

    “Statistics is, or should be, about scientific investigation and how to do it better, but many statisticians believe it is a branch of mathematics…. Now I agree that the physicist, the chemist, the engineer, and the statistician can never know too much mathematics, but their objectives should be better physics, better chemistry, better engineering, and in the case of statistics, better scientific investigation. Whether in any given study this implies more or less mathematics is incidental.”


    ------------------------------
    James Higgins
    Kansas State Univ
    ------------------------------




  • 9.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-19-2015 10:26

    Excellent remarks, with characteristic clarity, by George Box.  I cannot agree more.  Clear thinking and proper understanding of the scientific problem do not require mathematical virtuosity.  Exhibiting mathematical wizardry, at the expense of providing an effective solution, is never to be encouraged. Only as much mathematical machinery as is essential to solve the scientific problem at hand should be used. Of course, this is never an issue for lesser mortals, like myself, with limited mathematical skills! 
    ------------------------------
    Ravi Varadhan
    Johns Hopkins University
    ------------------------------




  • 10.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-19-2015 16:22

    All they need to know is...

    Just because it is mathematically possible does not mean it is statistically appropriate.


    ------------------------------
    Jacques Detiege
    Evaluation & Assessment Specialist
    Southern University System
    ------------------------------




  • 11.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-20-2015 04:05

    Excellent remarks, in my opinion. Thanks to all the participants.

    ------------------------------
    Murat ÖZEL
    Business Analytics Senior Manager
    TEB (BNPParibas JV)
    ------------------------------




  • 12.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-20-2015 08:50

    A mathematician believes that 2-1 ≠ 0, a statistician is uncertain.  
    ------------------------------
    Eugene Komaroff
    Keisser University
    ------------------------------




  • 13.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-21-2015 13:34

    Two mathematicians and a statistician went hunting.  Each mathematician took a shot: one was off 1 yard to the left and the other was off 1 yard to the right.  The statistician exclaims, "We hit it!"

    ------------------------------
    Charles Coleman
    ------------------------------




  • 14.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-24-2015 01:14

    A statistician is a mathematician broken down by age and sex.

    >>Kathy

    ------------------------------
    Katherine Godfrey
    ------------------------------




  • 15.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-20-2015 09:29

    "All over the world, human beings create an immense and ever–increasing volume of data, with new kinds of data regularly emerging from science and industry. A new understanding of the value of these data to society has emerged, and with it, a new and leading role for Statistics.

    In order to produce sensible theories and draw accurate conclusions from data, cutting edge statistical methods are needed. These methods use advanced mathematical ideas combined with modern computational techniques, which require expert knowledge and experience to apply."

    Well-said by Oxford University:

    http://www.ox.ac.uk/admissions/undergraduate/courses-listing/mathematics-and-statistics



  • 16.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-20-2015 20:49

    I am loving this discussion and so glad that your colleague asked for sources to read!  Ultimately, and i think this was mentioned by others,students really need to experience statistics as an investigative science and really have to think hard about how data can be used to support claims. If they engage in statistics in this was=y (rather than only being asked to compute and graph things and judged whether they understand how to use a formula, or why the formula works mathematically).

    I teach a course on "teaching and Learning Statistical thinking". ANd we start the semester off with several readings, two of which are in the NCTM 2006 yearbook.

    Rossman, A., Chance, B., & Medina, E. (2006). Some important comparisons between statistics and mathematics, and why teachers should care. In G. F. Burrill & P. C. Elliott (Eds.), Thinking and reasoning with data and chance: Sixty-eighth NCTM yearbook (pp. 139-150). Reston, VA: National Council of Teachers of Mathematics.

    Scheaffer, R. L. (2006). Statistics and mathematics: On making a happy marriage. In G. F. Burrill & P. C. Elliott (Eds.), Thinking and reasoning with data and chance: Sixty-eighth NCTM yearbook (pp. 139- 150). Reston, VA: National Council of Teachers of Mathematics.

    And a REALLY important seminal piece to read is this one:

    Wild, C. J., & Pfannkuch, M. (1999). Statistical thinking in empirical enquiry. International Statistical Review, 67(3), 223-248.

    At NCSU, I now also teach a MOOC designed for middle school teachers through high school/intro stats instructors at the college level. In that course I need brief readings that are FREE! Thus, myself and a colleagues synthesized some of the good work written by others into a 1-2 pager! You can find it here

    http://info.mooc-ed.org.s3.amazonaws.com/tsdi1/Unit%202/Essentials/Statvsmath.pdf

    And this reading is accompanied by another brief reading on statistical habits of mind. you can see that here

    http://info.mooc-ed.org.s3.amazonaws.com/tsdi1/Unit%202/Essentials/Habitsofmind.pdf

    And by the way, the MOOC (free) will run again this fall (starts Sept 28th)

    http://go.ncsu.edu/tsdi 

    Many Smiles

    Hollylynne Lee @ NC State



    ------------------------------
    Hollylynne Lee
    ------------------------------




  • 17.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-21-2015 10:13

    Thank you for your articles.  I teach AP and Dual Enrollment Statistics at a high school. For many of my students, this is their first exposure to Statistics - I plan on sharing these article with them during the first week of school to help them understand that while they are getting a math credit for taking the course - the thinking is not the same as a regular math class.  



    ------------------------------
    Jennifer Palmer
    Teacher
    Madison High School
    ------------------------------




  • 18.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-24-2015 11:38

    The ASA has a document discussing qualifications for teaching statistics at the undergraduate level:  http://www.amstat.org/education/pdfs/TeachingIntroStats-Qualifications.pdf. At the end of the document there are several very readable papers on the difference between mathematics and statistics.

     

    Within this list, I think the best one for AP students would be DeVeaux, R., and Velleman, P., (2008). Math is Music; Statistics is Literature. Amstat News, Sept 2008, No 375, p. 54. A quick google search will find pdf versions online.

     

    -Shonda

     






  • 19.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-24-2015 12:45

    The fact that we can even ask such a question is really quite remarkable. It is simply assumed that math and statistics strongly overlap. The nexus between them, of course is probability theory. The predominance of probability in (modern) statistics is a recent phenomenon historically. The story of how this came about is fascinating and has important implications for both the power and the limitations of statistical methodology. I rather shamelessly recommend my recent book Willful Ignorance: The Mismeasure of Uncertainty, in which I have tried to tell this story. I believe the issues are critically important, whether or not you agree with my interpretations of the past and prognoses for the future.

    ------------------------------
    Herbert Weisberg
    President
    Causalytics, LLC
    ------------------------------




  • 20.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-21-2015 00:34

    So, translating a little bit, you would like to to receive an explication of the relationship between mathematics and statistics?

    Interestingly, I delivered a paper (really more of a think-piece) at JSM14 in Boston which spoke to those issues, but in a somewhat tangential way.  The main focus of the presentation was observational causal inference, but, to get to that destination, I had to conduct a tour of my conception of how mathematics and statistics are related and will in the future be related.  Surprisingly, it seems that certain aspects of the relationship will change over time.  The paper appears in the JSM14 proceedings under the title, "Toward a General Theory of Observational Causal Inference" and can also be found at http://tinyurl.com/tgtoci

    As a quick and very limited cut at the relationship between mathematics and statistics, I will first present the following one sentence Gestalt, pointing in the direction of my opinion:

    "Frequentist and Bayesian statistics can be considered to be axiomatic, applied mathematical discourses,ultimately becoming inquiring paradigms."

    If this opaque thought tells you everything you need to know, then you can move on to the next item on your to do list.  If not, then I would suggest you read further.

    For almost everyone this sentence may raise more questions than it answers.  For that reason I have below presented an extract from the think-piece which rummages around in the area of interest, while traversing to the new observational statistical paradigm I call causal statistics.  Oddly enough, this extraction may convey a fuller understanding of the connection between mathematics and statistics, than if I had started out with the goal of simply answering your question, because of the contextual relationships presented.  Additionally, the extractive passage also points to how the sought relationship may morph in the future:

    In 1953, Einstein, in a letter to a colleague, wrote (paraphrased here for clarity) that

    the phenomenal  success of the physical (i.e., experimental)  sciences is based on two great achievements: 1) mathematical axiomization,  as in the derivation of Euclidean geometry, and 2) causal inferences, arrived at through experimentation and leading to causal theories.

    In this paper, we shall deal with both of these great achievements in the context of both new and old statistical paradigms and the sciences, especially the social sciences.

    Thales and Einstein were thinking  only about the physical sciences. But Thales’ insight is equally foundational and important for the social sciences. Yet the social sciences have shown not one 10,000th of the success of the physical sciences. Why is this? Think about it . . .

    The general answer is that there are a number of reasons why, but the most important reason is the inability of social science researchers to experiment.  In nonexperimental (also called observational) research, causal inferences are vastly more difficult. Social scientists are therefore very limited in their ability to build causal theories and social science research consumers are similarly limited in their ability to intervene appropriately. They are largely relegated to prediction, only.

    1. Can Classical Statistics Be Used to Draw Causal Inferences?

    Summary of Section 2: Neither frequentist nor Bayesian statistical  paradigms can be used to draw causal inferences because the word cause is not a part of their derivations.  These inquiring systems cannot reach conclusions about a concept that they have no knowledge of, like the mathematical discourse of Euclidean geometry cannot by itself draw conclusions about the apparent color produced by overlaying florescent blue and yellow, congruent triangles. The answer is that the resulting figure would appear to be a green triangle, but that result could only be attained from geometry, if geometry were extended to incorporate the color wheel or if a new color geometry were derived.

    Classical statistics can be considered to be an axiomatic  mathematical  discourse, as is Euclidean geometry. For purposes of this presentation frequentist and Bayesian statistics will be considered the components of classical statistics. Fiducial statistics will be disre- garded here because of its limited, current use.

    Andrey Kolmogorov (1903–1987) derived probability theory beginning with six ax- ioms which later mathematicians reduced to three. Building on the pure mathematical discourse of probability theory, every time you teach your basic statistics class, you do a simplified derivation of classical statistics. The resulting pure mathematical discipline of statistics can then be transformed into the two alternative  applied mathematical  branches of frequentist and Bayesian statistics by inserting two different definitions for the technical element of probability.

    An understanding of the relationships between Euclidean and non–Euclidean geome- tries will be helpful in comprehending the situation among Frequentist statistics, Bayesian statistics, and other potential statistical paradigms.  Does Euclidean geometry applied to everything?  No, it only applies to flat planes.  If you wanted to determine the number of degrees in a triangle  inscribed on a sphere the total number of degrees in the three angles would be greater than 180. Similarly, if you wanted to determine the number of degrees in a triangle inscribed on the inside surface of a sphere, like the inside of a world globe, the number of degrees would be less than 180. Various non–Euclidean geometries apply to these types of situations, but Euclidean geometry does not.

    A non–Euclidean geometry is one in which one or more fundamental  aspects, often a postulate, fundamental to Euclidean geometry, is altered. For example, one might alter the fifth postulate in Euclidean geometry, i.e., the famous parallel postulate, so that parallel lines do intersect, which does not happen on flat planes, but is something that happens on both inside and outside surfaces of a sphere.  Then,  using one of these modified  axiom sets, new and different  theorems can be derived. These altered theorems then apply to different  parts of the world (e.g., on convex surfaces) from the world of flat planes, for which Euclidean geometry is applicable.

    The point being that it is normal and typical for a given branch of applied mathematics, including its provable theorems, to be applicable to specific phenomena or specific aspects of the real world and not to others.

    The question at issue here is whether or not classical statistics can be employed to make causal inferences from either experimental or nonexperimental studies. The applied math- ematical discourses of frequentist and Bayesian statistics are perfectly capable of handling, for example, sample means or correlations  and inferring their values to populations, but, by themselves, mathematically incapable of drawing causal inferences from samples.

    This is because in the derivations of these classical  inquiring paradigms, the word

    ”cause” is never mentioned––not in the axioms, not in the technical terms or primitives, and not in the metalanguage (e.g., English). Therefore  causes and causal inference  can- not be a part of or a result of either one of these classical statistical paradigms, leading to the logically necessary conclusion that any statistical inquiring  system which could validly draw statistical causal inferences must be a paradigm distinct  from the classical statistical disciplines, i.e., an extension of classical statistics or a whole new paradigm.

    Do you find it interesting that causality is the most fundamental and most important concept in science and yet the classical statistical paradigms cannot handle it?  I find it shocking and almost incomprehensible!  That’s a huge void in the firmament of statistics.

    This is reminiscent of an old story which I will adapt for the situation.  A statistician and a wheelchair–bound  social science researcher were ambulating down a path in the rain forest and came upon a mango tree with ripe mangoes at the very top. The statistician went directly  to the tree and started pulling  off leaves. He then ate some of them and gave some to be researcher to eat. Puzzled, the social science researcher asked the statistician,  You know the leaves have little food value and taste terrible, why not climb to the top and pick the mangoes to eat. The statistician responded, That would be far better, but the leaves are so much easier to get and there is no risk of falling.

    Causal knowledge is far better, but correlations are much easier to get, there is no risk of failure, and parenthetically  they are accepted for tenure.

     

    1. So, How Have the Physical  Sciences Been so Successful at Drawing Causal Inferences?

    Summary of Section 3: The physical/experimental sciences have been overwhelmingly suc- cessful, by intuitively extending the classical paradigms by implicitly inserting causality and injecting unstated, but simple and generally acceptable, assumptions.

    Thales began science by claiming  that natural phenomena had natural causes, as op- posed to being caused by the gods. The physical/experimental  sciences have been over- whelmingly successful, by intuitively applying Thales’ insight. Yet few if any of the scien- tists (or statisticians for that matter) were conscious of the full explication of the mathemat- ics or statistics underlying their efforts. They were tacitly extending the classical statistics paradigms into a different paradigm that might be appropriately called experimental causal statistics. Their implicit processes and assumptions were typically  not questioned because they were so intuitively appealing and generally acceptable.

    The fact that scientists and statisticians are not cognizant of some of the logical founda- tions of their research activities  may sound incredible,  but it’s really not. Both Euclid and Einstein, in their derivations,  made intuitive and implicit assumptions that they were not consciously aware of at the time. In fact in Euclid’s case it was 21 centuries before anyone became aware of his implicit assumptions.

    To understand these intuitive and subconscious structures, one must at least allude to them explicitly. Analytically, one would begin with classical statistics and extend it with ad- ditional  mathematical processes. Cause would be inserted into the extended axiomization  as a technical  term;  its definition or interpretation would be specified using the metalan- guage (e.g., English);  a modern statement of Thales’ insight would be input as an axiom; and other needed assumptions or axioms concerning the experimental design would be inserted.

    The updated Thales axiom would note that all events have natural (i.e., non–mystical) causes and/or that all real correlations are in some way a result of causal mechanisms. This would lay the foundation for progressing from correlation (which is observable, measur- able, and calculable) to the inference of causation (which is none of those things).

    A typical  and generally acceptable experimental assumption would be that the exper- imenters’ action in manipulating  the putative cause is not correlated with other potential causes of the putative effect. An example counter to this assumption would be the exper- imenter who arises early every morning  and concludes that his Snap, Crackle,  and Pop breakfast food causes the cock to crow, when in fact the sunrise is the causal variable.

    Through this explicit, lengthened derivation, it is clear that the resulting applied math- ematical discourse is no longer a classical statistical  paradigm.  It is an expanded, more capable and far–reaching statistical inquiring system, arrived at via continued axiomiza- tion, beginning from the classical statistics discourse. This resulting paradigm, that I call experimental  causal statistics,  is capable of logically manipulating  and relating correla- tion, causation, and experimental data and, in particular, of drawing causal inferences from experimental data. It gives an explicit, logical foundation for the research which physi- cal/experimental  scientists have been carrying  out intuitively and implicitly over the past few hundred years.

    On the other hand, I don’t mean to imply that I have above actually  performed the extended derivation.  I have only been explicit about describing how it would be done. The actual extended axiomization  would, by itself, be a full paper and the proofs of important theorems could be another paper or so.

    Now, turning to nonexperimental/observational causal inference, that is a whole different kettle of fish.

     

    1. What about Causal Inference in the Social Sciences?

    Summary of Section 4: The social sciences are largely limited to conducting nonexperi- mental studies. These and other observational  sciences have had the opposite experience to that of the physical/experimental  sciences. The assumptions required for extension of classical statistics to draw causal inferences from social/observational studies, are volumi- nous, complicated, and anything but generally acceptable. Therefore the development of a new observational causal inquiring paradigm is the only reasonable path forward.

    One would hope that for the social and other observational sciences, one could do a similar thing, i.e., extend the classical statistics, to that which was done for the physi- cal/experimental  sciences implicitly and in this paper alluded to explicitly.  Unfortunately that turns out not to be the case.

    Obtaining knowledge in the social/observational  sciences is far more difficult at every level than obtaining knowledge in the physical/experimental  sciences. There are far more variables involved in almost any social science situation. Typically the variables are far less precise; for example confidence may originate from intellectual performance or athletic performance. But, of all the difficulties  faced by the social sciences, the most devastating is their general inability to experiment.

    This inability to experiment makes drawing causal inferences almost impossible. Cer- tainly the comparatively simple extension to classical statistics that worked for the physical sciences has not and will not work for the social sciences, epidemiology,  and other nonex- perimental disciplines. For social scientists to have any general chance of drawing causal connections from their observational studies, a completely new causal inquiring  paradigm would have to be developed, derived.

    The difficulties have dissuaded most statisticians and social scientists from even at- tempting observational causal inference, although many have chosen, inappropriately,  to come down in the middle by using ambiguous synonyms like leads to, results in, yields, etc. All this does is add to the confusion.

    A few of the best statisticians have, during the last 100 years, perceived the need and tried to appropriately fill it. They attempted to do largely  as the physical scientists did, i.e., intuitively and more or less implicitly extend classical statistics to draw causal connections from nonexperimental data.

    Unfortunately  these efforts,  although receiving  a certain amount of acclaim in their time, have, in the end, been generally unsuccessful. We know that these efforts have been unsuccessful because, if any one of them was truly capable of, in a generalized and reason- ably applicable way, drawing  observational causal inferences, the response would’ve been sensational.  Any paradigm truly capable of that would revolutionize  the social and other nonexperimental  sciences and that hasn’t happened.

    Further and surprisingly,  there is now empirical  evidence that these approaches have not worked. In 2011, Stanley Young and Alan Karr analyzed 52, effectively randomly selected, published  studies which  had reported success in their efforts to draw nonexper- imental causal inferences through the utilization an extension of classical statistics.  The findings of Young and Karr were presented in Significance under the title, ”Deming, Data and Observational  Studies: a Process Out Of Control and Needing Fixing.”

    In the lead up to the paper, the editors wrote:

    Any claim coming from an observational study is most likely to be wrong. Startling, but true. Coffee  causes pancreatic  cancer. Type A personality causes heart attacks. Transfat is a killer. Women who eat breakfast cereal give birth to more boys. All these claims come from observational studies; yet when the studies are carefully examined, the claimed links appear to be incorrect.  What is going wrong? Some have suggested that the scientific method is failing, that nature itself is playing tricks on us. But it is our way of studying nature that is broken and that urgently needs mending  . . . .

    Specifically,  at the end of their analysis, they concluded that not even one of the 52 ”dis- covered” nonexperimental causal conclusions was borne out by experimental replication.  Further, to add insult to injury,  in five of the cases, experimental findings yielded significant causal inferences in the opposite direction to those found by intuitive and implicit extension of classical statistics.

    The natural question at this point is, why would an extension to classical statistics work so well for the physical  sciences be so powerless when applied to the social and other nonexperimental sciences? The answer is that the experimenter  acts as a touchstone, over repeated runs limiting the causing variables to one, the one manipulated  by the experi- menter. In the nonexperimental case there is no such touchstone; any variables, even those not considered by the study, can be the cause or causes of a change in the putative effect variable.  This is analogous to trying to solve an algebraic equation of 10 unknowns with only one equation. There are logical ways around this, but they are complicated and require questionable assumptions. It’s not impossible it’s only almost impossible.

    This is a shocking and almost inconceivable finding and is testatory of the noxiousness of the causal–theory–construction difficulties faced by social, health, and other observa- tional scientists.  As Young and Karr mildly, but pointedly,  note in their title, this needs fixing.

    To their credit they proposed  a fix, but even if their suggestions were scrupulously followed,  the outcome would be beneficial,  but far from a complete  fix. I dealt with this issue some time ago and the next Section is a report on the development and the results of that attempted fix.

    1. A Proposed Fix

    Summary of Section 5: This Section reports on the development of a new statistical paradigm for inferring causal connections from observational studies. It outlines the philosophical and technical investigations, the logic which led to the conclusion that an axiomization was required, and the derivation of the new inquiring  paradigm. The derivation and accompa- nying information  and explanations are about 300 pages long, necessitating the summa- rization reported in this Section.



    ------------------------------
    C. Sterling Portwood, PhD
    Causal Statistician
    Center for Interdisciplinary Science
    ------------------------------




  • 21.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-27-2015 09:53



    ------------------------------
    Maria Shubina
    ------------------------------
    |  If you wanted to determine the number of degrees in a triangle  inscribed on a sphere the total number of degrees

    |  in the  three angles would be greater than 180.

    Agree, sphere has positive curvature.

    |   Similarly, if you wanted to determine the number of degrees in a triangle inscribed on the inside surface of a sphere,

    |   like the inside of a world globe, the number of degrees would be less than 180.

    Do not agree: inside of a sphere is just  the same as its outside.

    To get a triangle with the number of degrees < 180 you need to consider a surface of negative curvature, like hyperboloid.





  • 22.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-29-2015 20:23

    Yes Maria, you are right.  Thank you for that correction.

    ------------------------------
    C. Sterling Portwood, PhD
    Causal Statistician
    Center for Interdisciplinary Science
    ------------------------------




  • 23.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 09-02-2015 00:02

    I have received a few insightful comments and questions on my original response to Sam Cook’s question, including from my son.  One of the insights that I gleaned from the comments was a renewed and deeper understanding of how people can read the same word or set of words and internalize different meanings.  Incredibly, the initial words that needed more detailed clarification were “mathematics” and “statistics.”  Given that understanding, I thought it might be worthwhile and interesting for me to answer the original question anew, with more precise specifications of definitions and greater precision of explanation.

    That task sounded simple enough, but it turned out to be anything but.  In interpreting and parsing the word “mathematics,” it turns out that “applied mathematics” and “pure mathematics” each mean different things in different contexts and to different people.  I have used the phrases many times myself without confusion because everyone in the conversation understood the context in which the phrase was being used.  But trying, in this write up, to explain those phrases, their various meanings in different contexts, and their apparent conflicts got very complicated and caused me to back up, start again, and leave out interpretations that weren’t absolutely essential to making my points concerning the original question of the relationship between mathematics and statistics.

    Before getting to the answer to Sam Cook’s question, I need to clarify a number of definitions and understandings about the terms “mathematics” and “statistics.”

    The coarsest level of dissection of the word “mathematics” is the common dichotomy between “applied mathematics” and “pure mathematics.”  Examples of “applied mathematics” utilized at this common level are arithmetic, algebra, trigonometry, and calculus.  Examples of “pure mathematics” employed at this level of coarseness are Euclidean geometry, abstract algebra, analysis, and topology.  

    My first cut definition of “applied mathematics” is the development and use of mathematical models to reach or to prove logical predictions or implications.  For example, beginning with a mathematical model or function, say concerning profit, the maximum profit point can be predicted by differentiation of the functional model.

    In the common usage, “pure mathematics” is concerned with axiomatization of subfields of mathematics, like Euclidean geometry.  Euclidean geometry begins with assumptions (i.e. postulates or axioms), technical terms, and definitions (like the meaning of congruence).  Technical terms are normally of three types: elements (like point or line), relationships among the elements (like greater than or equal to), and operations that might be performed on the elements (like addition or multiplication).  “Pure mathematics” is also concerned with the logical derivations of theorems within the discourse, utilizing the aforementioned beginnings.  All of these things, aggregated together, form a branch or discourse of pure or axiomatic mathematics, e.g., Euclidean geometry.  Such packaged axiomatizations are referred to as a discourse, a field, a subfield, a discipline within or a subset or a piece of pure or axiomatic mathematics.  

    This common distinction between the two phrases are different from the definitions and distinction employed by research mathematicians.  Fortunately, it turns out that I can make my points without drawing on the usages of research mathematicians.  So to avoid additional complication and confusion, I will not utilize the definitions and distinction employed by research mathematicians.

    Virtually all discourses in mathematics can be both applied and pure, depending on what forms they take and how they are used and/or viewed.  For example, an early mathematician (prior to the time of the Greek philosophers), say from Egypt or Mesopotamia, might have arrived at the Pythagorean theorem or formula through experimentation and inductive logic and then applied that formulation to the building of pyramids or houses.  This would be considered applied geometry.  On the other hand, Euclidean geometry, beginning with postulates, etc., would be considered a discipline in pure or axiomatic mathematics.

    Now, what about “statistics?”  Like most discourses, statistics can be considered to be a branch of applied mathematics or, if properly axiomatized, a discipline of pure mathematics.  Most people probably think of the field of statistics simply as an applied mathematical construct, with no reference to axiomatization. 

    I think of statistics as an axiomatic subsystem of mathematics, beginning with Andrey Kolmogorov’s (1903–1987) axiomatization of probability theory.  Then, every time you teach your basic statistics class, you build on the pure mathematical discourse of probability theory and perform a simplified derivation of classical statistics.

    The resulting pure mathematical discipline of statistics can then be transformed into the two alternative axiomatic mathematical branches of frequentist and Bayesian statistics by inserting two different definitions for the technical element of probability.

    The goal of statistics is to obtain knowledge by interrogating the world, mostly via data.  That’s why statistical discourses are referred to as inquiring systems.  With all of this background, it may now be possible to more easily understand my single sentence answer to Sam Cook’s question concerning the relationship between statistics and mathematics:

    "Frequentist and Bayesian statistics can be considered to be axiomatic mathematical discourses, ultimately becoming inquiring paradigms."

    If you understand this one sentence, then you understand what I considered to be the relationship between statistics and mathematics.  If the meaning of that one sentence is still unclear, then you may want to reread the excerpt, previously presented, from my paper to comprehend, based on the definitions presented above, what I see as the connection between statistics and mathematics.



    ------------------------------
    C. Sterling Portwood, PhD
    Causal Statistician
    Center for Interdisciplinary Science
    ------------------------------




  • 24.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 09-08-2015 00:05

    As I was reading over some of the responses sent to me concerning my view of the relationship between statistics and mathematics, I recognized a strain of thinking, among some, that I felt deserved refutation.  But first I must define my terms (elements) clearly, delineate relationships, etc.; almost like an axiomatization. 

    I refer to the full realm of statistics variously as panoptic, generalized, or global statistics. Panoptic statistics could be thought of as a collection of probability-based inquiring systems. It is composed of two or three inquiring, mathematical discourses, viz. frequentist, Bayesian, and maybe fiducial statistics.

    The view that I object to, is that the field of global statistics is generally well defined and delimited, i.e., that generalized statistics is situated within a box with impenetrable sides, but a box with a more or less open bottom.  By that, I mean that most statisticians see the opportunities for research and expansion of knowledge within panoptic statistics to be in plumbing the depths within frequentist, Bayesian, and fiducial statistics and not in escalating the number of discourses or subfields making up global statistics.  That’s why the bottom of the box is more or less open and the sides are impenetrable. 

    This wrongheaded mindset that I speak of is the belief that horizontal expansion of generalized statistics to encompass or create additional inquiring paradigms is verboten or, worse yet, that no other such statistical discourses or subfields exterior to the box can exist.  My take is that this is an artificial mental limitation connected with global statistics, which places a stranglehold on expansive research in the field.  One important subset of that limitation is the restriction of panoptic statistics from dealing with issues beyond means, correlations, etc.

    As mentioned in previous postings on this topic, I am working on the development of a new statistical discourse called observational causal statistics.  The issues in that subfield are different from those inside the box.  These issues are observational data, causation, causal connections, and causal inference.  Under the limited vision of generalized statistics, such research would be impossible to carry out under the rubric of statistics. 

    Looking back historically for insights, until not too long ago, frequentist statistics was the only accepted paradigm within generalized statistics.  The same mindset of limitation and exclusion caused a great deal of conflict over the admission of Bayesian statistics to the party, but now that battle is generally regarded as won.  Fiducial statistics is interesting, but seldom used and, therefore, not worth spending time on, here.

    Another objection that often arises, both stated and unstated, is the homily that we all learned at our professors’ knees, “Correlation does not imply causation.”  That statement is accepted by almost all statisticians, yet it is more complicated than that.  The quote seems incompatible with my field of research.  What’s going on here?  Is this statement true or is it false?

    The proper answer is that the quotation is partially right and partially wrong.  It is right if you limit your thinking to “the box,” were classical statistics resides.  In fact, maybe a better way to say it is that the quote is meaningless inside the box, because none of the three paradigms in the box know what causality is.  It’s sort of like talking about the Euclidean geometry of blue and yellow triangles.  Euclidean geometry doesn’t deal with color, therefore any discussion of color in relationship to Euclidean geometry is meaningless. 

    But if you can give your mind permission to stray outside the box, then it might be clear to most statisticians/mathematicians that a different, inquiring subfield (i.e., observational causal statistics) might make causal inferences possible.  And the old adage that “Correlation does not imply causation” might be incorrect there.

    Many of the sharpest statisticians throughout the history of the field have recognized the centrality of causal inference to the sciences and have attempted to develop techniques for drawing non-experimental causal inferences.  They tried to establish causal connections through the utilization of the paradigms of classical statistics or intuitive extensions to those discourses.

    Even with all of the high-level brainpower brought to bear, these efforts have been generally unsuccessful (see Young and Karr, Significance, 2011).  I would argue that there difficulties arise from the fact that they stayed in the box or very close to it, when they should have broken out of the box and developed a new and original axiomatic paradigm designed for the express purpose of making observational causal inferences.



    ------------------------------
    C. Sterling Portwood, PhD
    Causal Statistician
    Center for Interdisciplinary Science
    ------------------------------




  • 25.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-21-2015 10:04

    This isn't a reference to an "accessible reading"; it's a story that relates to the topic.

    When I took my final orally given exam for my MS in Applied Statistics, the department was called "Mathematics and Statistics".  The professor who taught most of the theory courses told me to imagine a situation wherein something was distributed according to a simple and well-known distribution.  (It think it was binomial but this occurred many years ago.)  He then asked me to determine the conditional distribution given a second piece of information.  That made Bayes' Theorem the obvious thing to use.  After I solved the problem I stood there staring at the whiteboard and he asked me what I was doing.  When I said, "I'm trying to understand why the result is Poisson-distributed", he could not understand what I meant.  The point here is that I was looking for an intuitive understanding of the result whereas he saw no need for an intuitive understanding of something that had been formally proven.
    ------------------------------
    Emil M Friedman, PhD
    emil.friedman@alum.mit.edu (forwards to day job)
    emilfrie@alumni.princeton.edu (home)
    http://www.statisticalconsulting.org
    ------------------------------




  • 26.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 08-31-2015 11:55

    I have written a book entitled "Random Tales of a College Math Professor, Over-Easy, on Wry," that provides examples of mathematics and statistics applications - some typical and some odd - that occurred during my career of teaching, consulting, and expert-witnessing.  The author's name is Mick Norton, the name I am called by those who know me.

    ------------------------------
    Robert Norton
    ------------------------------




  • 27.  RE: Accessible Reading on Statistics vs Mathematics

    Posted 09-05-2015 11:07

    Sam,

    Here is a clarifying problem-based view of statistics, which is in the May/June 2015 issue of Analytics Magazine, http://goo.gl/Wod3gk

    ------------------------------
    Randy Bartlett
    ------------------------------