two_trans
Related Work
l Abundant research on related problems
l DB: approximate join, merge/purge, record linkage
l DL: citation matching, author name disambiguation
l AI: identity uncertainty
l LIS: name authority control
l In a nutshell, existing approaches often do:
l For two entities, e1 and e2, capture their information in data structures,
D(e1) and D(e2)
l Measure the distance or similarity between data structures: dist(D(e1),
D(e2)) = d
l Determine for matching:
o If d < threshold, then e1 and e2 are matching entities
l Work well for common applications
l Ours performs better when
l Entities lack useful information
WIDM 2007