An Architectural Technical Debt Index Based on Machine Learning and Architectural Smells

Research output: Contribution to journalArticleAcademicpeer-review

15 Citations (Scopus)
183 Downloads (Pure)

Abstract

A key aspect of technical debt (TD) management is the ability to measure the amount of principal accumulated in a system. The current literature contains an array of approaches to estimate TD principal, however, only a few of them focus specifically on architectural TD, but none of them satisfies all three of the following criteria: being fully automated, freely available, and thoroughly validated. Moreover, a recent study has shown that many of the current approaches suffer from certain shortcomings, such as relying on hand-picked thresholds. In this article, we propose a novel approach to estimate architectural technical debt principal based on machine learning and architectural smells to address such shortcomings. Our approach can estimate the amount of technical debt principal generated by a single architectural smell instance. To do so, we adopt novel techniques from Information Retrieval to train a learning-to-rank machine learning model (more specifically, a gradient boosting machine) that estimates the severity of an architectural smell and ensure the transparency of the predictions. Then, for each instance, we statically analyse the source code to calculate the exact number of lines of code creating the smell. Finally, we combine these two values to calculate the technical debt principal. To validate the approach, we conducted a case study and interviewed 16 practitioners, from both open source and industry, and asked them about their opinions on the TD principal estimations for several smells detected in their projects. The results show that for 71% of instances, practitioners agreed that the estimations provided were representative of the effort necessary to refactor the smell.

Original languageEnglish
Pages (from-to)4169-4195
Number of pages27
JournalIEEE Transactions on Software Engineering
Volume49
Issue number8
Early online date14-Jun-2023
DOIs
Publication statusPublished - 1-Aug-2023

Keywords

  • arcan
  • architectural smells
  • case study
  • learning-to-rank
  • Machine learning
  • technical debt

Fingerprint

Dive into the research topics of 'An Architectural Technical Debt Index Based on Machine Learning and Architectural Smells'. Together they form a unique fingerprint.

Cite this