CS6210: Document clustering in DLs
12
Evaluation
w
Agglomerative hierarchical clustering more
superior to
k-
means.
w
Speed is important.
w
Fast algorithm preferred
Bisecting
k-
means
w
Suffix tree
Linear time complexity
Suffix tree built incrementally
O. Zamir and O. Etzioni.
Web document clustering: A
feasibility demonstration
. In SIGIR, 1998.