A Language-independent Metric for Measuring Text Simplification that does not Require a Parallel Corpus

Autor/innen

  • Lucas Mucida Federal University of Vicosa
  • Alcione Oliveira Federal University of Vicosa
  • Maurilio Possi Federal University of Vicosa

DOI:

https://doi.org/10.32473/flairs.v35i.130608

Abstract

Natural language processing encompasses several tasks, one of which is the automatic text simplification. Telling whether one text is simpler than another involves not only knowledge about the language being analyzed, but also a cultural knowledge of the target audience to which the text is being directed. Most of the current metrics used to measure text simplification are based on the use of parallel corpora, prepared by humans, which makes it difficult to apply the metrics in automatic text simplification in real time. In this paper, we present ISiM (Independent Simplification Metric), a metric that dismiss a parallel corpus, is simple, fast, language and human annotation independent, capable of quantifying the simplicity/complexity of a sentence, thus contributing improve automating text simplification. The results of the tests performed indicate that the proposed metric has the potential to be used to evaluate automatic methods of simplification.

Downloads

Veröffentlicht

2022-05-04

Zitationsvorschlag

Mucida, L., Oliveira, A., & Possi, M. (2022). A Language-independent Metric for Measuring Text Simplification that does not Require a Parallel Corpus. The International FLAIRS Conference Proceedings, 35. https://doi.org/10.32473/flairs.v35i.130608

Ausgabe

Rubrik

Special Track: Applied Natural Language Processing