A Language-independent Metric for Measuring Text Simplification that does not Require a Parallel Corpus

作者

  • Lucas Mucida Federal University of Vicosa
  • Alcione Oliveira Federal University of Vicosa
  • Maurilio Possi Federal University of Vicosa

##plugins.pubIds.doi.readerDisplayName##:

https://doi.org/10.32473/flairs.v35i.130608

摘要

Natural language processing encompasses several tasks, one of which is the automatic text simplification. Telling whether one text is simpler than another involves not only knowledge about the language being analyzed, but also a cultural knowledge of the target audience to which the text is being directed. Most of the current metrics used to measure text simplification are based on the use of parallel corpora, prepared by humans, which makes it difficult to apply the metrics in automatic text simplification in real time. In this paper, we present ISiM (Independent Simplification Metric), a metric that dismiss a parallel corpus, is simple, fast, language and human annotation independent, capable of quantifying the simplicity/complexity of a sentence, thus contributing improve automating text simplification. The results of the tests performed indicate that the proposed metric has the potential to be used to evaluate automatic methods of simplification.

##submission.downloads##

已出版

2022-05-04

栏目

Special Track: Applied Natural Language Processing