Samenvatting

The interpretation of high throughput sequencing data is limited by our incomplete functional understanding of coding and non-coding transcripts. Reliably predicting the function of such transcripts can overcome this limitation. Here we report the use of a consensus independent component analysis and guilt-by-association approach to predict over 23,000 functional groups comprised of over 55,000 coding and non-coding transcripts using publicly available transcriptomic profiles. We show that, compared to using Principal Component Analysis, Independent Component Analysis-derived transcriptional components enable more confident functionality predictions, improve predictions when new members are added to the gene sets, and are less affected by gene multi-functionality. Predictions generated using human or mouse transcriptomic data are made available for exploration in a publicly available web portal. Our understanding of the function of many transcripts is still incomplete, limiting the interpretability of transcriptomic data. Here the authors use consensus-independent component analysis, together with a guilt-by-association approach, to improve the prediction of gene function.

Originele taal-2English
Artikelnummer1464
Aantal pagina's14
TijdschriftNature Communications
Volume12
Nummer van het tijdschrift1
Vroegere onlinedatum5-mrt-2021
DOI's
StatusPublished - 5-mrt-2021

Citeer dit