Line Classification – step 1
l Independent classification
SVM
Feature selection:
l Word-specific
• “X. Wang”         single capital :: capital non-dictionary word
l Line-specific
• no. of words, line number, percentage of non-dictionary
words etc.