Enforcing Grammar in Code Synthesis with Transformers

Dmytro Vitel; Stephen Steinle; John Licato

doi:10.32473/flairs.36.133363

Autor/innen

Dmytro Vitel University of South Florida https://orcid.org/0000-0003-4151-6858
Stephen Steinle University of South Florida
John Licato University of South Florida https://orcid.org/0000-0003-4700-9750

DOI:

https://doi.org/10.32473/flairs.36.133363

Schlagworte:

grammar enforcement, code generation, transformers, GPT, Hearthstone dataset

Abstract

Even more so than natural language, code is extremely sensitive to syntax; a small error could make an entire snippet invalid. It is therefore important to explore methods for ensuring syntactic correctness in generated code. Existing methods to resolve this issue often rely on the complex architecture of syntax-guided decoders. In this work, we present the grammar enforcement method, which introduces a separate layer that constrains the decisions of the transformer during fine-tuning according to syntactic constructs present both in the target language grammar and the given training set. We experiment with the Hearthstone dataset to study its effects on result programs and compare it with the existing state-of-art syntax-guided decoders. We demonstrate a statistically significant positive effect of grammar enforcement on the quality of generated programs in terms of exact match accuracy and grammatically correct percent of samples. At the same time, we observe lower values for text-based metrics, chrF, and BLEU, potentially indicating their inability to represent the quality of generated abstract syntax sequences.

Enforcing Grammar in Code Synthesis with Transformers

Autor/innen

DOI:

Schlagworte:

Abstract

Downloads

Veröffentlicht

Zitationsvorschlag

Ausgabe

Rubrik

Lizenz

entwickelt von

Beitrag einreichen

Sprache