From Descriptive to Predictive: Forecasting Emerging Research Areas in Software Traceability Using NLP from Systematic Studies

Zaki Pauzi, Andrea Capiluppi

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)

Abstract

Systematic literature reviews (SLRs) and systematic mapping studies (SMSs) are common studies in any discipline to describe and classify past works, and to inform a research field of potential new areas of investigation. This last task is typically achieved by observing gaps in past works, and hinting at the possibility of future research in those gaps. Using an NLP-driven methodology, this paper proposes a meta-analysis to extend current systematic methodologies of literature reviews and mapping studies. Our work leverages a Word2Vec model, pre-trained in the software engineering domain, and is combined with a time series analysis. Our aim is to forecast future trajectories of research outlined in systematic studies, rather than just describing them. Using the same dataset from our own previous mapping study, we were able to go beyond descriptively analysing the data that we gathered, or to barely 'guess' future directions. In this paper, we show how recent advancements in the field of our SMS, and the use of time series, enabled us to forecast future trends in the same field. Our proposed methodology sets a precedent for exploring the potential of language models coupled with time series in the context of systematically reviewing the literature.

Original languageEnglish
Title of host publicationProceedings of the 18th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2023
EditorsHermann Kaindl, Hermann Kaindl, Hermann Kaindl, Mike Mannion, Leszek Maciaszek, Leszek Maciaszek
PublisherScience and Technology Publications, Lda
Pages538-545
Number of pages8
ISBN (Electronic)9789897586477
DOIs
Publication statusPublished - 2023
Event18th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2023 - Prague, Czech Republic
Duration: 24-Apr-202325-Apr-2023

Publication series

NameInternational Conference on Evaluation of Novel Approaches to Software Engineering, ENASE - Proceedings
Volume2023-April
ISSN (Electronic)2184-4895

Conference

Conference18th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2023
Country/TerritoryCzech Republic
CityPrague
Period24/04/202325/04/2023

Keywords

  • Natural Language Processing
  • Software Traceability
  • Systematic Review
  • Time Series

Fingerprint

Dive into the research topics of 'From Descriptive to Predictive: Forecasting Emerging Research Areas in Software Traceability Using NLP from Systematic Studies'. Together they form a unique fingerprint.

Cite this