CS6210: Document clustering in DLs
16
Experiments
wExperiment features
*Text only (X)
*Text + Title (T)
*Text + Anchor Words (A)
*Text + Title + Anchor Words (TA)
wDataset
*WebKB containing 4159 web pages form computer science departments of 4 universities (Cornell, Texas, Washington & Wisconsin).
*7 categories – student, faculty, staff, department, course, project & other.