IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Automatically Labelled Software Topic Model

Automatically Labelled Software Topic Model
View Sample PDF
Author(s): Youcef Bouziane (Université Oran1, Oran, Algeria), Mustapha Kamel Abdi (Université Oran 1, Oran, Algeria)and Salah Sadou (IRISA, Université Bretagne Sud, Vannes, France)
Copyright: 2020
Volume: 11
Issue: 1
Pages: 22
Source title: International Journal of Open Source Software and Processes (IJOSSP)
Editor(s)-in-Chief: Marta Catillo (Università degli Studi del Sannio, Italy)
DOI: 10.4018/IJOSSP.2020010104

Purchase

View Automatically Labelled Software Topic Model on the publisher's website for pricing and purchasing information.

Abstract

Public software repositories (SR) maintain a massive amount of valuable data offering opportunities to support software engineering (SE) tasks. Researchers have applied information retrieval techniques in mining software repositories. Topic models are one of these techniques. However, this technique does not give an interpretation nor labels to the extracted topics and it requires manual analysis to identify them. Some approaches were proposed to automatically label the topics using tags in SR, but they do not consider the existence of spam-tags and they have difficulties to scale to large tag space. This article introduces a novel approach called automatically labelled software topic model (AL-STM) that labels the topics based on observed tags in SR. It mitigates the shortcomings of manual and automatic labelling of topics in SE. AL-STM is implemented using 22K GitHub projects and evaluated in a SE task (tag recommending) against the currently used techniques. The empirical results suggest that AL-STM is more robust in terms of MAP and nDCG, and more scalable to large tag space.

Related Content

Roland Robert Schreiber. © 2023. 20 pages.
Ekbal Rashid, Mohan Prakash. © 2022. 16 pages.
Rasmita Panighrahi, Sanjay Kumar Kuanar, Lov Kumar. © 2022. 31 pages.
Sushil Kumar, SK Muttoo, V. B. Singh. © 2022. 16 pages.
Omprakash Tembhurne, Sonali Milmile, Ganesh R. Pathak, Atul O. Thakare, Abhijeet Thakare. © 2022. 17 pages.
Zouaoui Louhab, Fatma Boufera. © 2022. 15 pages.
Madonna Fanoos, Abeer Hamdy, Khaled A. Nagaty. © 2022. 19 pages.
Body Bottom