I gave a talk last week to the Detroit ASA section on "Is Reproducibility Even a Possibility?". I discussed what happens when you use a different random seeds to make your models. I chose to look at Logistic Regression and Decision Trees. With each random seed, you will find some terms are "significant" or "important" for all the models you make, no matter the random seed. Other terms will pop up as being significant or important only for a single or a few of those random seeds. So, trying to hyper-tune parameters, using "Deep Learning", etc, is pretty useless. Because, each model is more of an opinion, than a precise model of what the data tells us. Since each model finds different terms are important, you'll "tune" or "Deep Learn" some "right" terms and some "wrong" terms.
The error rates of each model made, with each random seed, are usually about the same. But, the different opinions each model gives about what is truly important will change each time.
Forgo the parameter tuning. Use an ensemble of results from whatever type of model you use. Just make sure you change the random seed each time.
https://www.youtube.com/watch?v=sYPvCE_au4Q
------------------------------
Andrew Ekstrom
Statistician, Chemist, HPC Abuser;-)
------------------------------