Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-23351
Publication type: Article in scientific journal
Type of review: Peer review (publication)
Title: How to identify class comment types? : a multi-language approach for class comment classification
Authors: Rani, Pooja
Panichella, Sebastiano
Leuenberger, Manuel
Di Sorbo, Andrea
Nierstrasz, Oscar
et. al: No
DOI: 10.1016/j.jss.2021.111047
10.21256/zhaw-23351
Published in: Journal of Systems and Software
Volume(Issue): 181
Issue: 111047
Issue Date: 19-Jul-2021
Publisher / Ed. Institution: Elsevier
ISSN: 0164-1212
1873-1228
Language: English
Subjects: Natural language processing technique; Code comment analysis; Software documentation
Subject (DDC): 005: Computer programming, programs and data
006: Special computer methods
Abstract: Most software maintenance and evolution tasks require developers to understand the source code of their software systems. Software developers usually inspect class comments to gain knowledge about program behavior, regardless of the programming language they are using. Unfortunately, (i) different programming languages present language-specific code commenting notations/guidelines; and (ii) the source code of software projects often lacks comments that adequately describe the class behavior, which complicates program comprehension and evolution activities. To handle these challenges, this paper investigates the different language-specific class commenting practices of three programming languages: Python, Java, and Smalltalk. In particular, we systematically analyze the similarities and differences of the information types found in class comments of projects developed in these languages. We propose an approach that leverages two techniques, namely Natural Language Processing and Text Analysis, to automatically identify various types of information from class comments i.e., the specific types of semantic information found in class comments. To the best of our knowledge, no previous work has provided a comprehensive taxonomy of class comment types for these three programming languages with the help of a common automated approach. Our results confirm that our approach can classify frequent class comment information types with high accuracy for Python, Java, and Smalltalk programming languages. We believe this work can help to monitor and assess the quality and evolution of code comments in different program languages, and thus support maintenance and evolution tasks.
URI: https://digitalcollection.zhaw.ch/handle/11475/23351
Fulltext version: Published version
License (according to publishing contract): CC BY-NC-ND 4.0: Attribution - Non commercial - No derivatives 4.0 International
Departement: School of Engineering
Organisational Unit: Institute of Computer Science (InIT)
Published as part of the ZHAW project: COSMOS – DevOps for Complex Cyber-physical Systems of Systems
Appears in collections:Publikationen School of Engineering

Files in This Item:
File Description SizeFormat 
2021_Rani-etal_Identify-class-comment-types.pdf2.17 MBAdobe PDFThumbnail
View/Open
Show full item record
Rani, P., Panichella, S., Leuenberger, M., Di Sorbo, A., & Nierstrasz, O. (2021). How to identify class comment types? : a multi-language approach for class comment classification. Journal of Systems and Software, 181(111047). https://doi.org/10.1016/j.jss.2021.111047
Rani, P. et al. (2021) ‘How to identify class comment types? : a multi-language approach for class comment classification’, Journal of Systems and Software, 181(111047). Available at: https://doi.org/10.1016/j.jss.2021.111047.
P. Rani, S. Panichella, M. Leuenberger, A. Di Sorbo, and O. Nierstrasz, “How to identify class comment types? : a multi-language approach for class comment classification,” Journal of Systems and Software, vol. 181, no. 111047, Jul. 2021, doi: 10.1016/j.jss.2021.111047.
RANI, Pooja, Sebastiano PANICHELLA, Manuel LEUENBERGER, Andrea DI SORBO und Oscar NIERSTRASZ, 2021. How to identify class comment types? : a multi-language approach for class comment classification. Journal of Systems and Software. 19 Juli 2021. Bd. 181, Nr. 111047. DOI 10.1016/j.jss.2021.111047
Rani, Pooja, Sebastiano Panichella, Manuel Leuenberger, Andrea Di Sorbo, and Oscar Nierstrasz. 2021. “How to Identify Class Comment Types? : A Multi-Language Approach for Class Comment Classification.” Journal of Systems and Software 181 (111047). https://doi.org/10.1016/j.jss.2021.111047.
Rani, Pooja, et al. “How to Identify Class Comment Types? : A Multi-Language Approach for Class Comment Classification.” Journal of Systems and Software, vol. 181, no. 111047, July 2021, https://doi.org/10.1016/j.jss.2021.111047.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.