N-gram Feature Selection for Authorship Identification
Abstract
Automatic authorship identification offers a valuable tool for supporting crime investigation and security. It can be seen as a multi-class, single-label text categorization task. Automatic authorship identification depends on selecting stylisticfeatures that would capture an authors writing style independent of the content or genre of text. Character n-grams have been used successfully to represent text for stylistic purposes in literature.They seem to be able to capture nuances in lexical, sy...
Σημειώσεις
$aΗ εργασία έχει ψηφιοποιηθεί, αλλά ο συγγραφέας ΔΕΝ έχει ορίσει τα δικαιώματα πρόσβασης.