¡Rule-based scripts (fragile):
lStill heavily cited and used!
l
¡Wrapper induction: localized extraction
lDefine a local context and features to match and extract
l
¡Text classification: classification
lUse features over the entire document to determine classification.
l