基于神经网络融合模型的源代码注释自动生成

doi:10.3969/j.issn.1674-1579.2021.02.006

摘要/Abstract

摘要： 注释可以有效提高源代码的可读性、帮助开发者理解软件功能，对于软件的维护和演化起着关键作用. 当前源代码注释自动生成方面的研究存在一定局限，一是没有深入挖掘词法信息；二是没能很好的融合词法和语法信息. 因此，提出了基于神经网络融合模型的源代码注释自动生成方法，该方法利用编码器解码器神经网络框架深度表征源代码的词法信息，结合基于语法树挖掘到的语法信息，使用融合机制形成更加全面的功能语义编码向量用于注释自动生成. 通过在公开数据集上进行实验，该方法在BLEU4、METEOR等评价指标上均优于对比的模型，验证了方法的有效性.

关键词: 源代码注释, 抽象语法树, 编码器解码器, 融合模型

Abstract: The comments are very helpful for understanding the source code and play an important role in software maintenance and evolution. Existing works show that the lack of source code comments is one common practice in realworld projects. Current studies on automatic source code comments generation have two limitations. Firstly, they only use much simple lexical information; secondly, they do not use the lexical and syntactic information well. In this work, we propose a neural network fusion model for source code comments generation based on the encoderdecoder framework. Our model can embed the lexical information better, represent the syntax information based on abstract syntax tree, and then produce a fusion encoder to learn both the lexical and syntactic information for source code comments generation. The experiments on the public benchmark indicate that our fusion model outperforms the previous models by the metrics such as BLEU4 and METEOR.

Key words: source code comments, abstract syntax tree, encoderdecoder, fusion model

中图分类号:

TP311

周其林, 王旭, 刘旭东. 基于神经网络融合模型的源代码注释自动生成[J]. 空间控制技术与应用, 2021, 47(2): 42-48.

ZHOU Qilin, WANG Xu, LIU Xudong. A Neural Network Fusion Model for Source Code Comments Generation[J]. Aerospace Contrd and Application, 2021, 47(2): 42-48.

[1]	李鹏宇, 江云松, 高猛, 滕俊元. 基于知识图谱和自动机器学习的软件缺陷预测[J]. 空间控制技术与应用, 2021, 47(2): 10-16.
[2]	董云卫, 张涵博, 李勇军. 一种嵌入式软件安全漏洞的代码加固方法[J]. 空间控制技术与应用, 2021, 47(2): 17-24.
[3]	胡指铭, 黄丽桃, 赵涌鑫. 面向航天型号软件的混成建模语言研究[J]. 空间控制技术与应用, 2021, 47(2): 25-31.
[4]	刘晗, 陶红伟, 陈仪香. 面向源代码可信证据的航天软件可信度量评估方法[J]. 空间控制技术与应用, 2021, 47(2): 32-41.
[5]	邹萌, 张敏, 陈仪香. 面向安全属性的软件组件可信依赖关系度量模型[J]. 空间控制技术与应用, 2021, 47(2): 49-54.
[6]	董晓刚, 李经松, 王殿佑, 李川, 陈朝晖. 基于模型架构的航天器控制软件研制方式研究[J]. 空间控制技术与应用, 2021, 47(2): 55-62.
[7]	李青山, 廉宗民, 王璐, 谢生龙. 空间飞行器控制软件的动态自适应演化方法[J]. 空间控制技术与应用, 2021, 47(2): 63-72.
[8]	颜乐鸣, 刘从越, 陈申平. 基于系统科学的复杂系统软件生命周期模型研究[J]. 空间控制技术与应用, 2021, 47(2): 80-85.
[9]	陈立前, 吴国福, 姜加红. 航天嵌入式软件静态分析技术[J]. 空间控制技术与应用, 2021, 47(2): 86-92.
[10]	陈睿, 杨孟飞. 基于编码规则的中断数据访问冲突检测方法[J]. 空间控制技术与应用, 2017, 43(3): 59-65.
[11]	周育逵, 杨桦, 乔磊. 基于EventB的中断管理需求和设计形式化建模与验证方法[J]. 空间控制技术与应用, 2017, 43(3): 71-78.
[12]	奚坤, 王振华, 蔡雨辰, 陈朝晖. 航天器控制软件可靠性工程方法研究[J]. 空间控制技术与应用, 2016, 42(4): 48-.
[13]	赵性颂, 董晓刚, 杨晓龙, 牛和明, 高猛. 执行机构驱动单元软件的设计与分析[J]. 空间控制技术与应用, 2015, 41(5): 53-.
[14]	郝王松, 彭飞, 乔磊, 吴一帆, 刘波, 吴军. 基于Eclipse的航天嵌入式软件集成开发环境设计与实现[J]. 空间控制技术与应用, 2015, 41(4): 44-48.
[15]	傅秀涛, 綦艳霞, 陈朝晖. 航天嵌入式软件浮点运算误差分析与控制[J]. 空间控制技术与应用, 2015, 41(4): 54-57.

基于神经网络融合模型的源代码注释自动生成

A Neural Network Fusion Model for Source Code Comments Generation

PDF (PC)

赞

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

Metrics

本文评价

推荐阅读 10