Similarity network fusion for aggregating data types on a genomic scale

Bo Wang; Aziz M Mezlini; Feyyaz Demir; Marc Fiume; Zhuowen Tu; Michael Brudno; Benjamin Haibe-Kains; Anna Goldenberg

doi:10.1038/nmeth.2810

Similarity network fusion for aggregating data types on a genomic scale

Nat Methods. 2014 Mar;11(3):333-7. doi: 10.1038/nmeth.2810. Epub 2014 Jan 26.

Authors

Bo Wang¹, Aziz M Mezlini², Feyyaz Demir², Marc Fiume³, Zhuowen Tu⁴, Michael Brudno², Benjamin Haibe-Kains⁵, Anna Goldenberg²

Affiliations

¹ 1] Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada. [2].
² 1] Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada. [2] Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
³ Department of Computer Science, University of Toronto, Toronto, Ontario, Canada.
⁴ Department of Cognitive Science, University of California San Diego, San Diego, California, USA.
⁵ 1] Institut de Recherches Cliniques de Montréal, Université de Montréal, Montréal, Quebec, Canada. [2].

PMID: 24464287
DOI: 10.1038/nmeth.2810

Abstract

Recent technologies have made it cost-effective to collect diverse types of genome-wide data. Computational methods are needed to combine these data to create a comprehensive view of a given disease or a biological process. Similarity network fusion (SNF) solves this problem by constructing networks of samples (e.g., patients) for each available data type and then efficiently fusing these into one network that represents the full spectrum of underlying data. For example, to create a comprehensive view of a disease given a cohort of patients, SNF computes and fuses patient similarity networks obtained from each of their data types separately, taking advantage of the complementarity in the data. We used SNF to combine mRNA expression, DNA methylation and microRNA (miRNA) expression data for five cancer data sets. SNF substantially outperforms single data type analysis and established integrative approaches when identifying cancer subtypes and is effective for predicting survival.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Brain Neoplasms / genetics
Computational Biology / methods*
Disease / genetics
Gene Regulatory Networks*
Genomics*
Glioblastoma / genetics
Humans
Statistics as Topic / methods*