13 Nov 2004
WIDM 04: Lee et al. Co-training Web Block Classification
16
XHTML / DIV Evaluation
•Smaller dataset
–1/5 the size, limited sites for sample
–Both annotated and unannotated data sets were smaller
–As a result, fewer co-training iterations
•Single view model still seems to do better
• Single view = all features
• Combined = most confident of l and s learners