Group Stratified Shuffle Split cross validator for data science projects
-
Updated
Mar 30, 2020 - Jupyter Notebook
Group Stratified Shuffle Split cross validator for data science projects
It is from Kaggle Competitions where the training dataset is very small and the testing dataset is very large and we have to avoid or reduce overfiting by looking for best possible ways to overcome the most popular problem faced in field of predictive analytics.
We use the Iris dataset loaded in scikit-learn to demonstrate the basis for Machine Learning.
Add a description, image, and links to the shuffle-split topic page so that developers can more easily learn about it.
To associate your repository with the shuffle-split topic, visit your repo's landing page and select "manage topics."