24 Aug 2004
CS 5244: Indexing / Classification
46
Distributed Search? Why?
“Surface” Web vs. “Hidden” Web
l
“Surface” Web
–
Link structure
–
Crawlable
–
Documents indexed
by search engines
l
“Hidden” Web
–
No link structure
–
Documents “hidden” in databases
–
Documents not indexed by search engines
–
Need to query each collection individually
- From Panos Iperiotis’
VLDB presentation