From Tax Compliance in Natural Language to Executable Calculations: Combining Lexical-grammar-based Parsing and Machine Learning

作者

  • Esme Manandise Intuit
  • Conrad de Peuter Intuit
  • Saikat Mukherjee Intuit

##plugins.pubIds.doi.readerDisplayName##:

https://doi.org/10.32473/flairs.v34i1.128351

关键词:

tax domain, compliance, hybrid approach, executable calculations, raw texts, unannotated, lexical chart parsing, BERT

摘要

Regulatory agencies publish tax-compliance content written in natural language intended for human consumption. There has been very little work on automated methods for interpreting this content and for generating executable calculations from it. In this paper, we describe a combination of lexical grammar-based parsing with encoder-decoder architectures for automatically bootstrapping executable calculations from natural language. The combination is particularly suitable for domains such as compliance where training data is scarce and accuracy of interpretation is of high importance. We provide an overview of the implementation for North American income-tax forms.

##submission.downloads##

已出版

2021-04-18

栏目

Special Track: Applied Natural Language Processing