TDSM 7.13

From The Data Science Design Manual Wikia
Jump to: navigation, search

The complete data is divided into training, test and validation data.

Training Data : This data is used to train our model.

Validation Data : This dataset is used to find out which model is working better if various models are built using various approaches. This is used to tune hyper parameters

Test Data: This dataset is used to generate error or performance statistics of our model like RMSE, Precision, recall etc.