Robust Document Understanding
¡
OCR and document understanding are
(currently) fragile technologies
l
Full scan
OCR
store pipeline makes
many assumptions
l
What are some?
¡
________________
¡
________________
¡
________________
¡
________________
¡
________________