Inversion – Summary of Techniques
¡ How do these techniques stack up?
¡ Assume a 5 GB corpus and 40 MB main
memory machine
Technique           Memory    Disk   Time
           (MB)   (GB)   (Hours)
*Linked lists (memory)  4000   0   6
Linked lists (disk)     30   4   1100
Sort-based       40   8   20
Lexicon-based     40   0   79
Lexicon w/ disk     40   4   12