15
Yee Fan Tan, Min-Yen Kan and Dongwon Lee: Search Engine Driven Author Disambiguation
ACM/IEEE Joint Conference on Digital Libraries 2006
Discussion
•Apparent correlation between accuracy and average number of URLs returned per citation
–Author names with few URLs tend to fare poorly since results are mainly aggregator web sites
•We do not observe any apparent relation between accuracy and number of citations for an author name
–Our algorithm is scalable for large number of citations
•Analysis of returned URLs is very fast, execution time is dominated by search engine querying
–Querying may already be done while spidering, so our algorithm is time-efficient