TY - JOUR
T1 - Static and adaptive subspace information fusion for indefinite heterogeneous proximity data
AU - Münch, Maximilian
AU - Röder, Manuel
AU - Heilig, Simon
AU - Raab, Christoph
AU - Schleif, Frank Michael
N1 - Funding Information:
MM and MR thank the Bavarian HighTech agenda and the Würzburg Center for Artificial Intelligence and Robotics (CAIRO). SH was supported by Max-Weber Scholarship of the State Government of Bavaria. Additionally, we thank Benjamin Paassen (Bielefeld University) and Sascha Saralajew (NEC Laboratories Europe) for hours of highly beneficial discussions.
Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2023/10/28
Y1 - 2023/10/28
N2 - Heterogeneous data is common in many real-world machine learning applications, such as healthcare, market analysis, environmental sciences, and social media analysis. In these domains, data is often represented in different modalities and, most of the time, in non-vectorial formats, like text, images, and video. Traditional machine learning algorithms are often limited in their ability to effectively analyze and learn from such diverse data types. In this paper, we propose two approaches for such heterogeneous data analysis: static and adaptive subspace kernel fusion. The first approach is a kernel-based method extracting the essential parts of the subspace of each input modality and creating one single fused representation of the data. The second approach utilizes an adaptation step by integrating the weighting of spectral properties into the fusion process in order to improve the data's representation with respect to a given classification task. Our proposed methods are evaluated on several multi-modal, heterogeneous data sets and demonstrate significant performance improvement compared to other methods in the field. Our results highlight the importance of fusing the underlying subspace information of heterogeneous data for achieving superior performance in machine learning tasks.
AB - Heterogeneous data is common in many real-world machine learning applications, such as healthcare, market analysis, environmental sciences, and social media analysis. In these domains, data is often represented in different modalities and, most of the time, in non-vectorial formats, like text, images, and video. Traditional machine learning algorithms are often limited in their ability to effectively analyze and learn from such diverse data types. In this paper, we propose two approaches for such heterogeneous data analysis: static and adaptive subspace kernel fusion. The first approach is a kernel-based method extracting the essential parts of the subspace of each input modality and creating one single fused representation of the data. The second approach utilizes an adaptation step by integrating the weighting of spectral properties into the fusion process in order to improve the data's representation with respect to a given classification task. Our proposed methods are evaluated on several multi-modal, heterogeneous data sets and demonstrate significant performance improvement compared to other methods in the field. Our results highlight the importance of fusing the underlying subspace information of heterogeneous data for achieving superior performance in machine learning tasks.
KW - Heterogeneous data analysis
KW - Indefinite learning
KW - Kernel fusion
KW - Multi-modal data
KW - Multiple kernel learning
KW - Proximity learning
UR - http://www.scopus.com/inward/record.url?scp=85167827666&partnerID=8YFLogxK
U2 - 10.1016/j.neucom.2023.126635
DO - 10.1016/j.neucom.2023.126635
M3 - Article
AN - SCOPUS:85167827666
SN - 0925-2312
VL - 555
JO - Neurocomputing
JF - Neurocomputing
M1 - 126635
ER -