Spring Roundtable: Big Data!
Speaker: Dr. Yoav Freund
Topic: “Big Data and Statistics, A Better Union?”
Date: April 22, 2014
Time: 6-8:30 pm
Location: Pfizer, La Jolla
Food and drink will be served. Register with Vivian, let her know how many people are coming and if any are vegetarians. There is no cost to you to attend.
At the beginning of the 90's, data sets with 10,000 instances and 100 dimensions were considered "big". In 2014, data sets with a million examples and a million dimensions (sparsely populated) are common. This massive change in scale requires a reformulation of the basic tools of data analysis. On the data management side we have hdfs/hadoop, column based databases and distributed databases. On the statistical analysis level we have stochastic gradient descent, support vector machines, boosting and compressed sensing. Much progress has taken place on each side of the statistics / data management fence but integration of the two sides is still lagging. In this talk I will briefly review the current state of affairs and describe some of the work taking place in UCSD.
Contact Patricia at 858 952 4396 the day of the event for directions or other issues.
Yoav Freund is a professor of Computer Science and Engineering at UC San Diego. His work is in the area of machine learning, computational statistics and their applications. Dr. Freund is an internationally known researcher in the field of machine learning, a field which bridges computer science and statistics. He is best known for his joint work with Dr. Robert Schapire on the Adaboost algorithm. For this work they were awarded the 2003 Gödel prize in Theoretical Computer Science, as well as the Kanellakis Prize in 2004. His webpage can be found here.