Application of Seq2Seq Models on Code Correction

Huang, Shan and Zhou, Xiao and Chin, Sang (2021) Application of Seq2Seq Models on Code Correction. Frontiers in Artificial Intelligence, 4. ISSN 2624-8212

[thumbnail of pubmed-zip/versions/1/package-entries/frai-04-590215.pdf]

Text
pubmed-zip/versions/1/package-entries/frai-04-590215.pdf - Published Version
Download (2MB)

Official URL: https://doi.org/10.3389/frai.2021.590215

Abstract

We apply various seq2seq models on programming language correction tasks on Juliet Test Suite for C/C++ and Java of Software Assurance Reference Datasets and achieve 75% (for C/C++) and 56% (for Java) repair rates on these tasks. We introduce pyramid encoder in these seq2seq models, which significantly increases the computational efficiency and memory efficiency, while achieving similar repair rate to their nonpyramid counterparts. We successfully carry out error type classification task on ITC benchmark examples (with only 685 code instances) using transfer learning with models pretrained on Juliet Test Suite, pointing out a novel way of processing small programming language datasets.

Item Type:	Article
Subjects:	European Scholar > Multidisciplinary
Depositing User:	Managing Editor
Date Deposited:	06 Mar 2023 06:06
Last Modified:	24 Oct 2024 03:58
URI:	http://article.publish4promo.com/id/eprint/961

Actions (login required)

: View Item