A Multi-Dictionary Approach to Abstractness/Concreteness-Based Authorship Attribution

Autor/innen

DOI:

https://doi.org/10.32473/flairs.36.133262

Schlagworte:

authorship attribution, abstractness, concreteness

Abstract

We present some early results from a research project aimed at exploring the usefulness of abstractness/concreteness as stylistic features for authorship attribution. We conjecture that authors use abstract/concrete words and phrases in suf- ficiently unique ways, so that machine learning classifiers can learn to distinguish the individual authors’ writing styles. Our approach is based on using the abstractness rat- ings of words and phrases from texts with established au- thorship to generate training vectors for different machine learning classifiers. The combined word/phrase ratings are extracted from two separate abstractness dictionaries – an approach that yields stronger results than using single ab- stractness dictionaries. The paper describes the details of our methodology and compares the results to those obtained using traditional authorship attribution stylistic features. The limitations of our current methodology and directions for further research are outlined at the end of the paper.

Downloads

Veröffentlicht

2023-05-08

Zitationsvorschlag

Ivanov, L. (2023). A Multi-Dictionary Approach to Abstractness/Concreteness-Based Authorship Attribution. The International FLAIRS Conference Proceedings, 36(1). https://doi.org/10.32473/flairs.36.133262

Ausgabe

Rubrik

Special Track: Applied Natural Language Processing