Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification

Abstract Background Recent high throughput technologies have been applied for collecting heterogeneous biomedical omics datasets. Computational analysis of the multi-omics datasets could potentially reveal deep insights for a given disease. Most existing clustering methods by multi-omics data assume...

Full description

Saved in:
Bibliographic Details
Main Authors: Yin Guo (Author), Huiran Li (Author), Menglan Cai (Author), Limin Li (Author)
Format: Book
Published: BMC, 2019-12-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_fd2df23f73784c45b42f7ce9e9d13bbc
042 |a dc 
100 1 0 |a Yin Guo  |e author 
700 1 0 |a Huiran Li  |e author 
700 1 0 |a Menglan Cai  |e author 
700 1 0 |a Limin Li  |e author 
245 0 0 |a Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification 
260 |b BMC,   |c 2019-12-01T00:00:00Z. 
500 |a 10.1186/s12920-019-0633-1 
500 |a 1755-8794 
520 |a Abstract Background Recent high throughput technologies have been applied for collecting heterogeneous biomedical omics datasets. Computational analysis of the multi-omics datasets could potentially reveal deep insights for a given disease. Most existing clustering methods by multi-omics data assume strong consistency among different sources of datasets, and thus may lose efficacy when the consistency is relatively weak. Furthermore, they could not identify the conflicting parts for each view, which might be important in applications such as cancer subtype identification. Methods In this work, we propose an integrative subspace clustering method (ISC) by common and specific decomposition to identify clustering structures with multi-omics datasets. The main idea of our ISC method is that the original representations for the samples in each view could be reconstructed by the concatenation of a common part and a view-specific part in orthogonal subspaces. The problem can be formulated as a matrix decomposition problem and solved efficiently by our proposed algorithm. Results The experiments on simulation and text datasets show that our method outperforms other state-of-art methods. Our method is further evaluated by identifying cancer types using a colorectal dataset. We finally apply our method to cancer subtype identification for five cancers using TCGA datasets, and the survival analysis shows that the subtypes we found are significantly better than other compared methods. Conclusion We conclude that our ISC model could not only discover the weak common information across views but also identify the view-specific information. 
546 |a EN 
690 |a Subtype identification 
690 |a Multi-view clustering 
690 |a Subspace clustering 
690 |a Internal medicine 
690 |a RC31-1245 
690 |a Genetics 
690 |a QH426-470 
655 7 |a article  |2 local 
786 0 |n BMC Medical Genomics, Vol 12, Iss S9, Pp 1-17 (2019) 
787 0 |n https://doi.org/10.1186/s12920-019-0633-1 
787 0 |n https://doaj.org/toc/1755-8794 
856 4 1 |u https://doaj.org/article/fd2df23f73784c45b42f7ce9e9d13bbc  |z Connect to this object online.