TY - GEN
T1 - From Descriptive to Predictive
T2 - 18th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2023
AU - Pauzi, Zaki
AU - Capiluppi, Andrea
N1 - Publisher Copyright:
Copyright © 2023 by SCITEPRESS - Science and Technology Publications, Lda. Under CC license (CC BY-NC-ND 4.0)
PY - 2023
Y1 - 2023
N2 - Systematic literature reviews (SLRs) and systematic mapping studies (SMSs) are common studies in any discipline to describe and classify past works, and to inform a research field of potential new areas of investigation. This last task is typically achieved by observing gaps in past works, and hinting at the possibility of future research in those gaps. Using an NLP-driven methodology, this paper proposes a meta-analysis to extend current systematic methodologies of literature reviews and mapping studies. Our work leverages a Word2Vec model, pre-trained in the software engineering domain, and is combined with a time series analysis. Our aim is to forecast future trajectories of research outlined in systematic studies, rather than just describing them. Using the same dataset from our own previous mapping study, we were able to go beyond descriptively analysing the data that we gathered, or to barely 'guess' future directions. In this paper, we show how recent advancements in the field of our SMS, and the use of time series, enabled us to forecast future trends in the same field. Our proposed methodology sets a precedent for exploring the potential of language models coupled with time series in the context of systematically reviewing the literature.
AB - Systematic literature reviews (SLRs) and systematic mapping studies (SMSs) are common studies in any discipline to describe and classify past works, and to inform a research field of potential new areas of investigation. This last task is typically achieved by observing gaps in past works, and hinting at the possibility of future research in those gaps. Using an NLP-driven methodology, this paper proposes a meta-analysis to extend current systematic methodologies of literature reviews and mapping studies. Our work leverages a Word2Vec model, pre-trained in the software engineering domain, and is combined with a time series analysis. Our aim is to forecast future trajectories of research outlined in systematic studies, rather than just describing them. Using the same dataset from our own previous mapping study, we were able to go beyond descriptively analysing the data that we gathered, or to barely 'guess' future directions. In this paper, we show how recent advancements in the field of our SMS, and the use of time series, enabled us to forecast future trends in the same field. Our proposed methodology sets a precedent for exploring the potential of language models coupled with time series in the context of systematically reviewing the literature.
KW - Natural Language Processing
KW - Software Traceability
KW - Systematic Review
KW - Time Series
UR - http://www.scopus.com/inward/record.url?scp=85160566966&partnerID=8YFLogxK
U2 - 10.5220/0011964100003464
DO - 10.5220/0011964100003464
M3 - Conference contribution
AN - SCOPUS:85160566966
T3 - International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE - Proceedings
SP - 538
EP - 545
BT - Proceedings of the 18th International Conference on Evaluation of Novel Approaches to Software Engineering, ENASE 2023
A2 - Kaindl, Hermann
A2 - Kaindl, Hermann
A2 - Kaindl, Hermann
A2 - Mannion, Mike
A2 - Maciaszek, Leszek
A2 - Maciaszek, Leszek
PB - Science and Technology Publications, Lda
Y2 - 24 April 2023 through 25 April 2023
ER -