17 Mar 2004
CS 3243 - Learning
40
Training and testing sets
nWhere does the test set come from?
1.Collect a large set of examples
2.Divide into training and testing data
3.Train on training data, assess on testing
4.Repeat 1-3 for different splits of the set.
¡
nSame distribution
n “Learning … enable[s] the system to do the task or tasks drawn from the same population” – Herb Simon
¡To think about: Why?
n