TY - JOUR
T1 - Deviation from baseline mutation burden provides powerful and robust rare-variants association test for complex diseases
AU - Jiang, Lin
AU - Jiang, Hui
AU - Dai, Sheng
AU - Chen, Ying
AU - Song, Youqiang
AU - Tang, Clara Sze Man
AU - Pang, Shirley Yin Yu
AU - Ho, Shu Leong
AU - Wang, Binbin
AU - Garcia-Barcelo, Maria Mercedes
AU - Tam, Paul Kwong Hang
AU - Cherny, Stacey S.
AU - Li, Mulin Jun
AU - Sham, Pak Chung
AU - Li, Miaoxin
N1 - Publisher Copyright:
© 2022 The Author(s).
PY - 2022/4/8
Y1 - 2022/4/8
N2 - Identifying rare variants that contribute to complex diseases is challenging because of the low statistical power in current tests comparing cases with controls. Here, we propose a novel and powerful rare variants association test based on the deviation of the observed mutation burden of a gene in cases from a baseline predicted by a weighted recursive truncated negative-binomial regression (RUNNER) on genomic features available from public data. Simulation studies show that RUNNER is substantially more powerful than state-of-the-art rare variant association tests and has reasonable type 1 error rates even for stratified populations or in small samples. Applied to real case-control data, RUNNER recapitulates known genes of Hirschsprung disease and Alzheimer's disease missed by current methods and detects promising new candidate genes for both disorders. In a case-only study, RUNNER successfully detected a known causal gene of amyotrophic lateral sclerosis. The present study provides a powerful and robust method to identify susceptibility genes with rare risk variants for complex diseases.
AB - Identifying rare variants that contribute to complex diseases is challenging because of the low statistical power in current tests comparing cases with controls. Here, we propose a novel and powerful rare variants association test based on the deviation of the observed mutation burden of a gene in cases from a baseline predicted by a weighted recursive truncated negative-binomial regression (RUNNER) on genomic features available from public data. Simulation studies show that RUNNER is substantially more powerful than state-of-the-art rare variant association tests and has reasonable type 1 error rates even for stratified populations or in small samples. Applied to real case-control data, RUNNER recapitulates known genes of Hirschsprung disease and Alzheimer's disease missed by current methods and detects promising new candidate genes for both disorders. In a case-only study, RUNNER successfully detected a known causal gene of amyotrophic lateral sclerosis. The present study provides a powerful and robust method to identify susceptibility genes with rare risk variants for complex diseases.
UR - http://www.scopus.com/inward/record.url?scp=85128247162&partnerID=8YFLogxK
U2 - 10.1093/nar/gkab1234
DO - 10.1093/nar/gkab1234
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 34931221
AN - SCOPUS:85128247162
SN - 0305-1048
VL - 50
SP - E34
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 6
ER -