TY - JOUR
T1 - ScalpelSig Designs Targeted Genomic Panels from Data to Detect Activity of Mutational Signatures
AU - Franzese, Nicholas
AU - Fan, Jason
AU - Sharan, Roded
AU - Leiserson, Mark D.M.
N1 - Publisher Copyright:
© Copyright 2022, Mary Ann Liebert, Inc., publishers 2022.
PY - 2022/1
Y1 - 2022/1
N2 - Over the past decade, a promising line of cancer research has utilized machine learning to mine statistical patterns of mutations in cancer genomes for information. Recent work shows that these statistical patterns, commonly referred to as "mutational signatures,"have diverse therapeutic potential as biomarkers for cancer therapies. However, translating this potential into reality is hindered by limited access to sequencing in the clinic. Almost all methods for mutational signature analysis (MSA) rely on whole genome or whole exome sequencing data, while sequencing in the clinic is typically limited to small gene panels. To improve clinical access to MSA, we considered the question of whether targeted panels could be designed for the purpose of mutational signature detection. Here we present ScalpelSig, to our knowledge the first algorithm that automatically designs genomic panels optimized for detection of a given mutational signature. The algorithm learns from data to identify genome regions that are particularly indicative of signature activity. Using a cohort of breast cancer genomes as training data, we show that ScalpelSig panels substantially improve accuracy of signature detection compared to baselines. We find that some ScalpelSig panels even approach the performance of whole exome sequencing, which observes over 10 × as much genomic material. We test our algorithm under a variety of conditions, showing that its performance generalizes to another dataset of breast cancers, to smaller panel sizes, and to lesser amounts of training data.
AB - Over the past decade, a promising line of cancer research has utilized machine learning to mine statistical patterns of mutations in cancer genomes for information. Recent work shows that these statistical patterns, commonly referred to as "mutational signatures,"have diverse therapeutic potential as biomarkers for cancer therapies. However, translating this potential into reality is hindered by limited access to sequencing in the clinic. Almost all methods for mutational signature analysis (MSA) rely on whole genome or whole exome sequencing data, while sequencing in the clinic is typically limited to small gene panels. To improve clinical access to MSA, we considered the question of whether targeted panels could be designed for the purpose of mutational signature detection. Here we present ScalpelSig, to our knowledge the first algorithm that automatically designs genomic panels optimized for detection of a given mutational signature. The algorithm learns from data to identify genome regions that are particularly indicative of signature activity. Using a cohort of breast cancer genomes as training data, we show that ScalpelSig panels substantially improve accuracy of signature detection compared to baselines. We find that some ScalpelSig panels even approach the performance of whole exome sequencing, which observes over 10 × as much genomic material. We test our algorithm under a variety of conditions, showing that its performance generalizes to another dataset of breast cancers, to smaller panel sizes, and to lesser amounts of training data.
KW - cancer genomics
KW - combinatorial optimization
KW - mutational signatures
UR - http://www.scopus.com/inward/record.url?scp=85123298957&partnerID=8YFLogxK
U2 - 10.1089/cmb.2021.0453
DO - 10.1089/cmb.2021.0453
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 34986026
AN - SCOPUS:85123298957
SN - 1066-5277
VL - 29
SP - 56
EP - 73
JO - Journal of Computational Biology
JF - Journal of Computational Biology
IS - 1
ER -