Published in

Proceedings of the 20th ACM international conference on Multimedia - MM '12

DOI: 10.1145/2393347.2396398

Links

Tools

Export citation

Search in Google Scholar

Conversationally-inspired Stylometric Features for Authorship Attribution in Instant Messaging

This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Authorship attribution (AA) aims at recognizing automati-cally the author of a given text sample. Traditionally applied to literary texts, AA faces now the new challenge of recog-nizing the identity of people involved in chat conversations. These share many aspects with spoken conversations, but AA approaches did not take it into account so far. Hence, this paper tries to fill the gap and proposes two novelties that improve the effectiveness of traditional AA approaches for this type of data: the first is to adopt features inspired by Conversation Analysis (in particular for turn-taking), the second is to extract the features from individual turns rather than from entire conversations. The experiments have been performed over a corpus of dyadic chat conversations (77 in-dividuals in total). The performance in identifying the per-sons involved in each exchange, measured in terms of area under the Cumulative Match Characteristic curve, is 89.5%.