24 Aug 2004
CS 5244: Indexing / Classification
46
Distributed Search? Why?
“Surface” Web vs. “Hidden” Web
l“Surface” Web
–Link structure
–Crawlable
–Documents indexed by search engines
l“Hidden” Web
–No link structure
–Documents “hidden” in databases
–Documents not indexed by search engines
–Need to query each collection individually
- From Panos Iperiotis’
VLDB presentation