two_trans
Step 1. Select Representative Data
l What to select
l A single token “aho”
l A key phrase “stanford professor”
l A sentence or more?
l How to select
l Assess importance
§ tf, tf*idf, latent topic models, …
l How many to select
l 1, 2, … n
l Where to select from?
l Contents of canonical entity, variant, both
WIDM 2007