Multilingual Automatic Term Extraction in Low-Resource Domains

作者

  • NGOC TAN LE Universite du Quebec a Montreal
  • Fatiha Sadat Universite du Quebec a Montreal

##plugins.pubIds.doi.readerDisplayName##:

https://doi.org/10.32473/flairs.v34i1.128502

摘要

With the emergence of the neural networks-based approaches, research on information extraction has benefited from large-scale raw texts by leveraging them using pre-trained embeddings and other data augmentation techniques to deal with challenges and issues in Natural Language Processing tasks. In this paper, we propose an approach using sequence-to-sequence neural networks-based models to deal with term extraction for low-resource domain. Our empirical experiments, evaluating on the multilingual ACTER dataset provided in the LREC-TermEval 2020 shared task on automatic term extraction, proved the efficiency of deep learning approach, in the case of low-data settings, for the automatic term extraction task.

##submission.downloads##

已出版

2021-04-18

栏目

Special Track: Semantic, Logics, Information Extraction and AI