A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction

Rachel Marceau, Wenbin Lu, Shannon Holloway, Michèle M. Sale, Bradford B. Worrall, Stephen R. Williams, Fang Chi Hsu, Jung Ying Tzeng

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)

Abstract

Kernel machine (KM) models are a powerful tool for exploring associations between sets of genetic variants and complex traits. Although most KM methods use a single kernel function to assess the marginal effect of a variable set, KM analyses involving multiple kernels have become increasingly popular. Multikernel analysis allows researchers to study more complex problems, such as assessing gene-gene or gene-environment interactions, incorporating variance-component based methods for population substructure into rare-variant association testing, and assessing the conditional effects of a variable set adjusting for other variable sets. The KM framework is robust, powerful, and provides efficient dimension reduction for multifactor analyses, but requires the estimation of high dimensional nuisance parameters. Traditional estimation techniques, including regularization and the "expectation-maximization (EM)" algorithm, have a large computational cost and are not scalable to large sample sizes needed for rare variant analysis. Therefore, under the context of gene-environment interaction, we propose a computationally efficient and statistically rigorous "fastKM" algorithm for multikernel analysis that is based on a low-rank approximation to the nuisance effect kernel matrices. Our algorithm is applicable to various trait types (e.g., continuous, binary, and survival traits) and can be implemented using any existing single-kernel analysis software. Through extensive simulation studies, we show that our algorithm has similar performance to an EM-based KM approach for quantitative traits while running much faster. We also apply our method to the Vitamin Intervention for Stroke Prevention (VISP) clinical trial, examining gene-by-vitamin effects on recurrent stroke risk and gene-by-age effects on change in homocysteine level.

Original languageEnglish
Pages (from-to)456-468
Number of pages13
JournalGenetic Epidemiology
Volume39
Issue number6
DOIs
Publication statusPublished - 2015 Sep 1

All Science Journal Classification (ASJC) codes

  • Epidemiology
  • Genetics(clinical)

Fingerprint Dive into the research topics of 'A Fast Multiple-Kernel Method With Applications to Detect Gene-Environment Interaction'. Together they form a unique fingerprint.

Cite this