Big Data Sessions at JSM

By Steve Pierson posted 07-30-2013 19:41

  

[4/13/14: NB, see Big Data Sessions at 2014 Joint Statistical Meetings.

6/22/15: see Big Data Sessions at the 2015 Joint Statistical Meetings]

As I was going through the JSM 2013 Program Book, I noticed many Big Data sessions and thought it would be useful to list them out, which I do below. If I missed you session or a talk, send me an email or, better yet, add it in the comment space below. My list isn't meant to be exhaustive; I mostly looked through invited and contributed session titles (not poster sessions, roundtables, ...)

See also these previous blog entries relating to big data:

[10/25/13 See also this subsequent blog post:

 

Big Data Sessions at JSM

  1. Statistical Computing for Big Data    CE_07C Sun, 8/4/2013, 8:30 AM - 5:00 PM W-Ville-Marie 
  2. Emerging Statistical Methods for Big Data     5 * ! Sun, 8/4/2013, 2:00 PM - 3:50 PM CC-710b   
  3. Spatial Statistics for Big Environmental Data Sets   6 * ! Sun, 8/4/2013, 2:00 PM - 3:50 PM CC-520e
  4. New Techniques for Big High-Dimensional Data   50 Sun, 8/4/2013, 4:00 PM - 5:50 PM CC-511b
  5. Recent Activity in Big Data: Curriculum Development and Funding Opportunities    58 * Sun, 8/4/2013, 4:00 PM - 5:50 PM CC-510a
  6. New Statistical Methodologies for Sampling and Inference with Big Data    73 Sun, 8/4/2013, 4:00 PM - 5:50 PM CC-525a
  7. JABES Showcase: Modern Dimension-Reduction Methods for Big Data Problems in Ecology 104 * Mon, 8/5/2013, 8:30 AM - 10:20 AM CC-511a
  8. From Real Time to Long Term: Applications of Big Data in Sports   107 * Mon, 8/5/2013, 8:30 AM - 10:20 AM CC-519a
  9. Innovative Bayesian Methods for Big, Complex Object Data Sets    118 Mon, 8/5/2013, 8:30 AM - 10:20 AM CC-520a
  10. Statistical Computing with Big Data  180 * ! Mon, 8/5/2013, 10:30 AM - 12:20 PM CC-520a
  11. Nonparametric Methods for Big Data    188 Mon, 8/5/2013, 10:30 AM - 12:20 PM CC-512d
  12. Visualizing Big Data Interactively      217 * ! Mon, 8/5/2013, 2:00 PM - 3:50 PM CC-510b
  13. Toward Big Data in Teaching Statistics   210 * ! Mon, 8/5/2013, 2:00 PM - 3:50 PM CC-510c  
  14. The Secret Weapon of the Dark Knight Against the Joker: Statistical Methods for Big and Massive Data Sets 328 * ! Tue, 8/6/2013, 10:30 AM - 12:20 PM CC-516d
  15. Introductory Overview Lecture: Big Data   392 Tue, 8/6/2013, 2:00 PM - 3:50 PM CC-710a
  16. Taming Big Data with Matrix and Tensor Decomposition Methods   398 Tue, 8/6/2013, 2:00 PM - 3:50 PM CC-520b
  17. Big Data Exploration with Amazon   514 ! Wed, 8/7/2013, 10:30 AM - 12:20 PM CC-516c
  18. Transitioning to Big Data: What Every Statistical Programmer/Analyst Should Know 573 * Wed, 8/7/2013, 2:00 PM - 3:50 PM CC-519a
  19. Big Data, Big Impact When Statistics Matter   663 * ! Thu, 8/8/2013, 10:30 AM - 12:20 PM CC-512ab

8/2/13 Addition: I noticed a presentations (poster and otherwise on data science that look interesting:

  1. The Emerging Role of the Data Scientist — Charles D. Kincaid, Experis Business Analytics     92 Sun, 8/4/2013, 8:30 PM - 10:30 PM CC-517cd
  2. Speaking Clearly About Data Scientists: A Survey and Clustering Analysis — Harlan D. Harris, Data Community DC ; Marck Vaisman, Data Community DC ; Sean P. Murphy, Data Community DC  92 Sun, 8/4/2013, 8:30 PM - 10:30 PM CC-517cd
  3. The Data Scientist Degree: A Necessity for Growth in Our Discipline`
  4. Precursors to the Data Explosion: Teaching How to Compute with Data          

I also did an Abstract Keyword Search for "big data" and got the following extensive list (I'm assuming some of the following talks are part of the sessions above):

Sunday

Nonparametric Bayes Multi-Task Multi-View Learning
Angela Schoergendorfer, IBM T.J. Watson Research Center; Hongxia Yang, IBM T.J. Watson Research Center


Capacity-Building in the Era of Big Data
Sastry G. Pantula, NSF
2:05 PM

BigVis: Visualizing Large Data in R
Hadley Wickham, RStudio
2:55 PM

Bayesian Analysis of Spatial Transformation Models with Applications in Neuroimaging Data
Michelle Miranda; Hongtu Zhu, UNC-Chapel Hill; Joseph G. Ibrahim, UNC
3:20 PM

Long Live (Big Data-Fied) Statistics!
Norman S. Matloff, The University of California, Davis
4:05 PM

Ensemble Learning for Big Data
Hugh A. Chipman, Acadia University; Robert E. McCulloch, The University of Chicago Booth School of Business; Matthew Pratola, Simon Fraser University; Dave Higdon, Los Alamos National Laboratories; James Gattiker, Los Alamos National Laboratories; Steven L Scott, Google
4:05 PM

Recent Activity in Big Data: Curriculum Development and Funding Opportunities
Nandini Kannan, NSF; Michael Rappa, NCSU; Bill Howe, University of Washington; Michelle Christine Dunn, National Cancer Institute
4:05 PM

Computational Strategies in Regression of Big Data
Ping Ma, University of Illinois at Urbana-Champaign
4:30 PM

Losing $3 Million and Being Happy: A Tale of Money, Lives, and Prediction
Bruce Swihart, Johns Hopkins School of Public Health; Ciprian M. Crainiceanu, The Johns Hopkins University; Brian Caffo, Johns Hopkins University; Rafa Irizarry, JHSPH; Yingying Wei, JHSPH; Jeff Goldsmith, Columbia University; Russell Shinohara, Univ of Pennsylvania; Gagan Sidhu, University of Alberta
4:35 PM

Programming with Big Data in R
George Ostrouchov, Oak Ridge National Laboratory; Wei-Chen Chen, Oak Ridge National Laboratory; Drew Schmidt, University of Tennessee; Pragneshkumar Patel, University of Tennessee
4:55 PM

Monday, 08/05/2013
Statistics for Spatio-Temporal Data: New Challenges
Christopher K. Wikle, University of Missouri
7:01 AM

Ecological Prediction with Nonlinear Multivariate Time-Frequency Functional Data Models
Christopher K. Wikle, University of Missouri; Wen-Hsi Yang, University of Missouri; Scott H. Holan, University of Missouri; Mark L. Wildhaber, U.S. Geological Survey
9:50 AM

Bayesian Object Regression for Complex, High-Dimensional Data
Jeffrey S. Morris, The University of Texas MD Anderson Cancer Center; Veera Baladandayuthapani, The University of Texas MD Anderson Cancer Center
9:55 AM

Making Rules Human-Interpretable for Alarm Prediction in Sensor Network
Hongfei Li, IBM T. J. Watson Research; Buyue Qian, UC Davis; Dhaivat Parikh, IBM GBS; Arun Hampapur, IBM Research
10:35 AM

MCMC and the Bias-Variance Tradeoff
Anoop Korattikara Balan, University of California; Yutian Chen, University of California, Irvine; Max Welling, University of Amsterdam
11:05 AM

SeqArray: An R/Bioconductor Package for Big Data Management of Genome-Wide Sequencing Variants
Xiuwen Zheng
11:05 AM

Estimating Average Proportional Changes in Large, Sparse Data
Ryan Giordano
11:50 AM

Ticks, Tweets, and Trails of Pain: Some Examples of Big Data in Business Research
James G Scott, The University of Texas at Austin
12:32 PM

Web-Based Interactive Graphics for Big Data
Simon Urbanek, AT&T Labs
2:05 PM

Reflection of Statistical Sciences: Past, Present, and Future---Celebration of the COPSS 50th Anniversary
Bernard Silverman, St Petere's College, University of Oxford; Norman Breslow, University of Washington; Rob Tibshirani, Stanford; Nancy Reid, University of Toronto; Donald B. Rubin, Harvard University; Kathryn Roeder, CMU
2:05 PM

Introducing Science Students to Big Data
Randall Pruim, Calvin College; Daniel Theodore Kaplan, Macalester College; Elizabeth Shoop, Macalester College
2:25 PM

Precursors to the Data Explosion: Teaching How to Compute with Data
Nicholas J. Horton, Smith College; Benjamin S. Baumer, Smith College; Daniel Theodore Kaplan, Macalester College; Randall Pruim, Calvin College
2:45 PM

Big Data: Does the Song Remain the Same?
Chris J. Wild, University of Auckland; Antony Unwin, IUniversity of Augsburg
3:05 PM

Tuesday, 08/06/2013
Research Questions and Data Resources in Transportation Statistics
Rolf Schmitt, Bureau of Transportation Statistics; David Banks, Duke University; Alan F. Karr, National Institute of Statistical Sciences; Clifford H. Spiegelman, Texas A & M University
8:35 AM

Variable Selection for Big Data via Bagging Adaptive Lasso and Precision Shrinking
Cory Lanker, Iowa State University of Science and Technology; Wen Zhou, Iowa State University; Max Morris, Iowa State University; Stephen Bruce Vardeman, Iowa State University; Huaiqing Wu, Iowa State University
11:20 AM

Using Hidden Markov Models to Identify Job Seekers from Social Network Data
Peter Ebbes, HEC Paris; Oded Netzer, Columbia University
11:35 AM

Bayesian Manifold Learning
David B. Dunson, Duke University
11:50 AM

The Role of Bayesian Analysis for an Emerging Class of Complex Data: Object Data
Jeffrey S. Morris, The University of Texas MD Anderson Cancer Center
12:30 PM

The Practical Aspects of Doing Statistics on Large Data Sets
Joseph Rickert
12:31 PM

The Challenges and Opportunities for Statisticians in RFID-Sensed Big Data
Heungsun Park, Department of Statistics, Hankuk University of Foreign Studies; Hyunsoo Kim, Kyonggi University
2:05 PM

Divide and Recombine (D&R) with RHIPE for Large Complex Data
William S. Cleveland, Purdue Universith
2:55 PM

Wednesday, 08/07/2013
Engineering Scientific Solutions
Yuliya Torosjan, Simulmedia; Krishna Balasubramanian, Simulmedia


Introduction: Statistical Visualization of Data and Process Structure
Aparna V. Huzurbazar, Statistical Sciences Group, Los Alamos National Laboratory
8:35 AM

The Future Is Now: Preparing Marketing Analytics Professionals for the New Age of Data
David Schweidel, Goizueta Business School, Emory University; Slavi Samardzija, KBM/Wonderman; Elea Feit, Wharton Customer Analytics Initiative; Marianna Dizik, Google, Inc.; Chris Mehrabi, newBrandAnalytics; Manila Austin, Communispace Corp.
8:35 AM

Analysis of Large Survey Data Sets Using Dynamically Generated SQL
Thomas Lumley, University of Auckland
9:15 AM

Challenges for Industrial Statisticians and Data Scientists
Winson Taam
9:35 AM

Bayesian and Frequentist Issues in Large-Scale Inference
Bradley Efron, Stanford University
10:35 AM

Linear Regression on 1 Terabytes of Data? Some Crazy Observations
Hesen Peng, Amazon.com
10:35 AM

A Regularized Regression for Large-Scale Online Advertising
Li Qin, Amazon
10:55 AM

Big Programs and the Use of High-Performance Computing
Natalie Cheung Hall, Eli Lilly and Company
2:05 PM

Scaling SAS Software from Small to Big Work
Jared L. Dean, SAS Institute
2:30 PM

Using Big Data for Practical Work
Nancy J. Petersen, Department of Veterans Affairs
2:55 PM

Thursday, 08/08/2013
Recent Advances in Claims Data--Based Total Health Care Cost Prediction
Donghui Wu, Elsevier/MEDai Inc.; Emad El-Sebakhy, Elsevier/MEDai Inc.; Krassimir Latinski, Elsevier/MEDai; Jun Han, MEDai, a LexisNexis Company; Ognian Asparouhov, Elsevier/MEDai Inc.
8:35 AM

 

1 comment
314 views

Comments

07-31-2013 10:08

Thank you, I find this quite helpful!