Abstract
Background: The new paradigm of precision medicine brought an increasing interest in survival prediction based on the integration of multi-omics and multi-sources data. Several models have been developed to address this task, but their performances are widely variable depending on the specific disease and are often poor on noisy datasets, such as in the case of non-small cell lung cancer (NSCLC).
Objective: The aim of this work is to introduce a novel computational approach, named multi-omic twolayer SVM (mtSVM), and to exploit it to get a survival-based risk stratification of NSCLC patients from an ongoing observational prospective cohort clinical study named PROMOLE.
Methods: The model implements a model-based integration by means of a two-layer feed-forward network of FastSurvivalSVMs, and it can be used to get individual survival estimates or survival-based risk stratification. Despite being designed for NSCLC, its range of applicability can potentially cover the full spectrum of survival analysis problems where integration of different data sources is needed, independently of the pathology considered.
Results: The model is here applied to the case of NSCLC, and compared with other state-of-the-art methods, proving excellent performance. Notably, the model, trained on data from The Cancer Genome Atlas (TCGA), has been validated on an independent cohort (from the PROMOLE study), and the results were consistent. Gene-set enrichment analysis of the risk groups, as well as exome analysis, revealed well-defined molecular profiles, such as a prognostic mutational gene signature with potential implications in clinical practice.
Graphical Abstract
[http://dx.doi.org/10.1016/j.ejca.2020.12.023] [PMID: 33578357]
[http://dx.doi.org/10.3238/arztebl.m2021.0025] [PMID: 33536117]
[http://dx.doi.org/10.1145/3214306]
[http://dx.doi.org/10.1214/12-AOAS597] [PMID: 23745156]
[http://dx.doi.org/10.1186/s13059-020-02015-1] [PMID: 32393329]
[http://dx.doi.org/10.1201/9781315137438]
[http://dx.doi.org/10.3389/fgene.2019.00166] [PMID: 30906311]
[http://dx.doi.org/10.1038/s41598-017-17031-8] [PMID: 29209073]
[http://dx.doi.org/10.1371/journal.pcbi.1006076] [PMID: 29634719]
[http://dx.doi.org/10.1038/nature12213] [PMID: 23770567]
[http://dx.doi.org/10.1093/bioinformatics/btaa462] [PMID: 32657401]
[http://dx.doi.org/10.1186/s12859-021-04279-1] [PMID: 34229612]
[http://dx.doi.org/10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4] [PMID: 8668867]
[http://dx.doi.org/10.1016/j.jtho.2015.09.009] [PMID: 26762738]
[http://dx.doi.org/10.1016/j.cllc.2022.05.007] [PMID: 35697558]
[http://dx.doi.org/10.1073/pnas.0506580102] [PMID: 16199517]
[http://dx.doi.org/10.14806/ej.28.0.1035]
[http://dx.doi.org/10.1093/nar/gkz369] [PMID: 31066453]
[http://dx.doi.org/10.1038/75556] [PMID: 10802651]
[http://dx.doi.org/10.2174/1566524015666150505161412] [PMID: 25941815]
[http://dx.doi.org/10.1634/theoncologist.2019-0846] [PMID: 32396674]
[http://dx.doi.org/10.1038/onc.2015.185] [PMID: 26050618]
[http://dx.doi.org/10.2147/DDDT.S363665] [PMID: 35685767]
[http://dx.doi.org/10.1093/carcin/bgv026] [PMID: 25863124]
[http://dx.doi.org/10.1186/s13073-020-00760-3] [PMID: 32669118]
[http://dx.doi.org/10.1002/jso.21520] [PMID: 20213688]
[http://dx.doi.org/10.1038/s41598-017-09951-2] [PMID: 28855549]
[http://dx.doi.org/10.1038/s41598-018-32796-2] [PMID: 30258109]
[http://dx.doi.org/10.1371/journal.pone.0237889] [PMID: 32817625]
[http://dx.doi.org/10.1038/s41429-021-00437-y] [PMID: 34282314]
[http://dx.doi.org/10.1186/s12943-021-01316-8] [PMID: 33546704]
[http://dx.doi.org/10.1101/gad.314617.118] [PMID: 30275043]