TDSM 7.13
From The Data Science Design Manual Wikia
The complete data is divided into training, test and validation data.
Training Data : This data is used to train our model.
Validation Data : This dataset is used to find out which model is working better if various models are built using various approaches. This is used to tune hyper parameters
Test Data: This dataset is used to generate error or performance statistics of our model like RMSE, Precision, recall etc.