8
Yee Fan Tan, Min-Yen Kan and Dongwon Lee: Search Engine Driven Author Disambiguation
ACM/IEEE Joint Conference on Digital Libraries 2006
External Resources
•Lay people doing this task with unfamiliar publications may use a search engine, using paper title as query
•Our method tries to approximate this
•For each citation c in C
–Query search engine with title of c as phrase search to obtain a set of relevant URLs
–Represent c by a feature vector of relevant URLs and weighting scheme
•Apply hierarchical agglomerative clustering (HAC) on C to derive k clusters
–Cosine similarity
–Tested with single link, complete link and group average