Exploring word surprisals and authorship verification | James' Coffee Blog
Last night, I was experimenting more with word surprisals (entropy), which are calculated using the probabilities of a word appearing in a specified corpus. For my analyses, I was using a corpus of articles from the New York Times to calculate word surprisals, which has proven effective for my blog [^1]. I started to think about whether you could use word surprisals for authorship verification.