Generalization performance relates to prediction capability on independent test data. How do we assess generalization performance and use it to select models?
See Section 7.4 Summary for definitions of test error, expected test error, and training error.
We estimate (expected) test error for two reasons:
Use validation set for model selection and test set only for assessment.
Approximate the selection step analytically (AIC, BIC, MDL) or by efficient sample re-use (cross-validation, bootstrap).