11 Oct 2005
CS 5244 - Computational Document Analysis
15
In summary…
ˇGenre and authorship analysis relies on highly frequent evidence that is portable across document subjects.
ˇContrast with subject/text classification which looks for specific keywords as evidence.
ˇ
ˇReferences:
ˇMosteller & Wallace (63) Inference in an authorship problem, J American Statistical Association 58(3)
ˇKarlgren & Cutting (94) Recognizing Text Genres with Simple Metrics Using Discriminant Analysis, Proc. of COLING-94.
ˇde Vel, Anderson, Corney & Mohay (01) Mining Email Content for Author Identification Forensics, SIGMOD Record
ˇFoster (00) Author Unknown. Owl Books PE1421 Fos 
ˇBiber (89) A typology of English texts, Linguistics, 27(3)
ˇLee and Myaeng (02) Text genre classification with genre-revealing and subject-revealing features, SIGIR 02
ˇ