Well for the sake of the weekend, there is an old joke among astrophysicists and the rate of paper publication on dark matter, string theory or whatever topic you want to mock. It can easily be applied to Big Data. And it includes - as most jokes do - an important truth:
"Imagine all new data created in all connected networks (aka internet) as a growing pile of paper. The velocity of its growth has already exceeded c - or at least will do so soon enough.
Why is this not a contradiction to the theory of relativity?
- Simple: The information added with each new dataset is zero."
Best regards and a creative (or lazy - whicherver you prefer) weekend,
Christian
PS.: The challenge in 'Big Data' is often to pick the information from a huge pile of mostly redundant or unimportant stuff and not to fall for the noise. And yes - 'machine learning' helps a great deal with that. However, when I was still a student, we simply called it 'numerics' or 'applied mathematics' ;-).
------------------------------
Christian Graf
Dipl.-Math.
Qualitaetssicherung & Statistik
"To call in the statistician after the experiment is done may be no more than asking him to perform a post-mortem examination: he may be able to say what the experiment died of."
Ronald Fisher in 'Presidential Address by Professor R. A. Fisher, Sc.D., F.R.S. Sankhyā: The Indian Journal of Statistics (1933-1960), Vol. 4, No. 1 (1938), pp. 14-17'
------------------------------
Original Message:
Sent: 12-08-2017 09:35
From: Kelly Zou
Subject: "Big data is dead. Data is just data."
Original Message------
Is it possible that "Big Data" was replaced with "Machine Learning" and "Data Science" because those are the types of skills needed to analyze "Big Data"?
For anyone that thinks data is just data, try working at Ford on autonomous cars or at U of Michigan health system taking signals from medical equipment. Both require taking live data from multiple streams of sensor arrays or arrays of sensors, and determining, "Is this data good?, If so, then what?" Get it right, no one knows or cares and things get better. Get it wrong, people die.
Having worked with this type of data, it's not uncommon for each sensor to send 100Hz to 1,000Hz of signal data times X number of sensors times Y hours.... I think that gets pretty big. Thankfully, there is Machine Learning.
------------------------------
Andrew Ekstrom
Statistician, Chemist, HPC Abuser;-)
------------------------------