Dear Giovanni,
I am teaching on Big Data in an industrial environment. From my point of view, the basic statistical concepts apply and you should continue to teach those to form the mindset a data analyst requires.
However, when it comes to big data, automated analysis of data quality and reproducibility of results becomes ever more important. I'd suggest to include this topic explicitely in your lecture.
For a start I recommend the following articles:
Puts et al.: Finding errors in Big Data: Finding errors in Big Data - Puts - 2015 - Significance - Wiley Online Library
Wiley |
remove preview |
|
Finding errors in Big Data - Puts - 2015 - Significance - Wiley Online Library |
Beate Franke, Jean-François Plante, Ribana Roscher, En-shiun Annie Lee, Cathal Smyth, Armin Hatefi, Fuqi Chen, Einat Gil, Alexander Schwing, Alessandro Selvitella, Michael M. Hoffman, Roger Grosse, Dieter Hendricks, Nancy Reid, Statistical Inference, Learning and Models in Big Data, International Statistical Review, 2016 Wiley Online Library |
View this on Wiley > |
|
|
Wickham, Hadley: Tidy Data. Journal of Statistical Software - see http://www.jstatsoft.org/
Also Roger D. Peng (Biostatistics) has published on this topic.
Best regards,
Christian
------------------------------
Christian Graf
Dipl.-Math.
Qualitaetssicherung & Statistik
"To call in the statistician after the experiment is done may be no more than asking him to perform a post-mortem examination: he may be able to say what the experiment died of."
Ronald Fisher in 'Presidential Address by Professor R. A. Fisher, Sc.D., F.R.S. Sankhyā: The Indian Journal of Statistics (1933-1960), Vol. 4, No. 1 (1938), pp. 14-17'
Original Message:
Sent: 08-18-2016 16:02
From: Giovanni Petris
Subject: Suggestions for teaching Mathematical Statistics
Dear All,
I have been teaching a Masters level course in Mathematical Statistics for several years out of Casella and Berger's textbook. The syllabus includes optimal estimators (Cramer-Rao, Rao-Blackwell, Lehmann-Scheffe), hypothesis testing, confidence intervals, convergence. However, it seems to me that the material is a bit outdated in the age of big data, sparse estimation, and multiple testing - especially at the Masters level. Therefore I was considering updating the course content.
I would like to know if anybody out there feel the same and if you have any suggestions for possible topics to include and to drop. Suggestions on textbooks on a modern version of Mathematical Statistics are very welcome too.
Thank you all in advance for the feedback!
Best,
Giovanni Petris
------------------------------
Giovanni Petris, PhD
Professor
Director of Statistics
Department of Mathematical Sciences
University of Arkansas - Fayetteville, AR 72701
------------------------------