Please use this identifier to cite or link to this item: https://doi.org/10.21256/zhaw-22748
Publication type: Conference paper
Type of review: Peer review (abstract)
Title: Makar : a framework for multi-source studies based on unstructured data
Authors: Birrer, Mathias
Rani, Pooja
Panichella, Sebastiano
Nierstrasz, Oscar
et. al: No
DOI: 10.1109/SANER50967.2021.00069
10.21256/zhaw-22748
Proceedings: Proceedings of the 2021 IEEE SANER Conference
Page(s): 577
Pages to: 581
Conference details: 28th IEEE International Conference on Software Analysis, Evolution and Reengineering, Honolulu, USA, 9-12 March 2021
Issue Date: 2021
Publisher / Ed. Institution: IEEE
ISBN: 978-1-7281-9630-5
Language: English
Subject (DDC): 005: Computer programming, programs and data
Abstract: To perform various development and maintenance tasks, developers frequently seek information on various sources such as mailing lists, Stack Overflow (SO), and Quora. Researchers analyze these sources to understand developer information needs in these tasks. However, extracting and preprocessing unstructured data from various sources, building and maintaining a reusable dataset is often a time-consuming and iterative process. Additionally, the lack of tools for automating this data analysis process complicates the task to reproduce previous results or datasets.To address these concerns we propose Makar, which provides various data extraction and preprocessing methods to support researchers in conducting reproducible multi-source studies. To evaluate Makar, we conduct a case study that analyzes code comment related discussions from SO, Quora, and mailing lists. Our results show that Makar is helpful for preparing reproducible datasets from multiple sources with little effort, and for identifying the relevant data to answer specific research questions in a shorter time compared to state-of-the-art tools, which is of critical importance for studies based on unstructured data. Tool webpage: https://github.com/maethub/makar
Further description: © 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
URI: https://digitalcollection.zhaw.ch/handle/11475/22748
Fulltext version: Accepted version
License (according to publishing contract): Licence according to publishing contract
Departement: School of Engineering
Organisational Unit: Institute of Computer Science (InIT)
Appears in collections:Publikationen School of Engineering

Files in This Item:
File Description SizeFormat 
2021_Birrer-etal_Makar-a-framwork-for-multi-source-studies.pdfAccepted Version494.8 kBAdobe PDFThumbnail
View/Open
Show full item record
Birrer, M., Rani, P., Panichella, S., & Nierstrasz, O. (2021). Makar : a framework for multi-source studies based on unstructured data [Conference paper]. Proceedings of the 2021 IEEE SANER Conference, 577–581. https://doi.org/10.1109/SANER50967.2021.00069
Birrer, M. et al. (2021) ‘Makar : a framework for multi-source studies based on unstructured data’, in Proceedings of the 2021 IEEE SANER Conference. IEEE, pp. 577–581. Available at: https://doi.org/10.1109/SANER50967.2021.00069.
M. Birrer, P. Rani, S. Panichella, and O. Nierstrasz, “Makar : a framework for multi-source studies based on unstructured data,” in Proceedings of the 2021 IEEE SANER Conference, 2021, pp. 577–581. doi: 10.1109/SANER50967.2021.00069.
BIRRER, Mathias, Pooja RANI, Sebastiano PANICHELLA und Oscar NIERSTRASZ, 2021. Makar : a framework for multi-source studies based on unstructured data. In: Proceedings of the 2021 IEEE SANER Conference. Conference paper. IEEE. 2021. S. 577–581. ISBN 978-1-7281-9630-5
Birrer, Mathias, Pooja Rani, Sebastiano Panichella, and Oscar Nierstrasz. 2021. “Makar : A Framework for Multi-Source Studies Based on Unstructured Data.” Conference paper. In Proceedings of the 2021 IEEE SANER Conference, 577–81. IEEE. https://doi.org/10.1109/SANER50967.2021.00069.
Birrer, Mathias, et al. “Makar : A Framework for Multi-Source Studies Based on Unstructured Data.” Proceedings of the 2021 IEEE SANER Conference, IEEE, 2021, pp. 577–81, https://doi.org/10.1109/SANER50967.2021.00069.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.