Pattern Recognition and Context Prediction of COVID-19 cases in European Countries

Arzu Tosayeva*, Ermiyas Birihanu, Tsegaye Misikir Tashu

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    27 Downloads (Pure)

    Abstract

    The global impact of the COVID-19 pandemic has been significant, which requires data analysis to understand trends and patterns. However, this endeavor is challenging due to the complex transmission dynamics and diverse factors that influence the virus's spread. The data associated with COVID-19 is extensive and constantly evolving, and extracting meaningful insights from it is difficult. Therefore, the objective of this study is to analyze the impact of COVID-19 in various European countries, to identify common patterns, and to make predictions within the relevant context. To accomplish this, we used clustering techniques to reveal patterns in COVID-19 cases among European countries. The implementation involved cluster analysis to estimate labels based on cluster size and density while considering relevant background information. Subsequently, a classification model was applied to the labeled dataset. Using the K-Prototypes algorithm and leveraging the Silhouette score for identification, we determined the optimal number of clusters. These clusters were then combined based on density, and the degree of sparsity was assessed. As a result, two clusters emerged: one labeled as "low chance of infection" and the other as "high chance of infection." Using these results, we implemented a classification algorithm, achieving an accuracy rate of 90%. For this study, we gathered data from five different sources, consolidating them into a single dataset. Our findings demonstrate that combining COVID-19 datasets with diverse features enables trend analysis, while the use of clustering algorithms facilitates successful label identification in unsupervised learning scenarios involving unlabeled data. The density and size of clusters prove valuable in estimating labels, enhancing our overall understanding of the data. Our code is publicly available here.

    Original languageEnglish
    Title of host publication23rd Conference Information Technologies - Applications and Theory, ITAT 2023
    PublisherCEUR Workshop Proceedings
    Pages50-57
    Number of pages8
    Publication statusPublished - 2023
    Event23rd Conference Information Technologies - Applications and Theory, ITAT 2023 - Tatranske Matliare, Slovakia
    Duration: 22-Sept-202326-Sept-2023

    Publication series

    NameCEUR Workshop Proceedings
    Volume3498
    ISSN (Print)1613-0073

    Conference

    Conference23rd Conference Information Technologies - Applications and Theory, ITAT 2023
    Country/TerritorySlovakia
    CityTatranske Matliare
    Period22/09/202326/09/2023

    Keywords

    • Context prediction
    • COVID-19
    • Label estimation
    • Pattern recognition

    Fingerprint

    Dive into the research topics of 'Pattern Recognition and Context Prediction of COVID-19 cases in European Countries'. Together they form a unique fingerprint.

    Cite this