Huang, Shan and Zhou, Xiao and Chin, Sang (2021) Application of Seq2Seq Models on Code Correction. Frontiers in Artificial Intelligence, 4. ISSN 2624-8212
pubmed-zip/versions/1/package-entries/frai-04-590215.pdf - Published Version
Download (2MB)
Abstract
We apply various seq2seq models on programming language correction tasks on Juliet Test Suite for C/C++ and Java of Software Assurance Reference Datasets and achieve 75% (for C/C++) and 56% (for Java) repair rates on these tasks. We introduce pyramid encoder in these seq2seq models, which significantly increases the computational efficiency and memory efficiency, while achieving similar repair rate to their nonpyramid counterparts. We successfully carry out error type classification task on ITC benchmark examples (with only 685 code instances) using transfer learning with models pretrained on Juliet Test Suite, pointing out a novel way of processing small programming language datasets.
Item Type: | Article |
---|---|
Subjects: | European Scholar > Multidisciplinary |
Depositing User: | Managing Editor |
Date Deposited: | 06 Mar 2023 06:06 |
Last Modified: | 24 Oct 2024 03:58 |
URI: | http://article.publish4promo.com/id/eprint/961 |