A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank
暂无分享,去创建一个
R. Tibshirani | T. Hastie | M. Rivas | Junyang Qian | Yosuke Tanigawa | Matthew Aguirre | Chris Chang | Wenfei Du
[1] Andrey Ziyatdinov,et al. Efficient analysis of large-scale genome-wide data with two R packages: bigstatsr and bigsnpr , 2018, Bioinform..
[2] P. Visscher,et al. Nature Genetics Advance Online Publication , 2022 .
[3] Trevor Hastie,et al. Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.
[4] P. Visscher,et al. 10 Years of GWAS Discovery: Biology, Function, and Translation. , 2017, American journal of human genetics.
[5] Marcelo P. Segura-Lepe,et al. Rare and low-frequency coding variants alter human adult height , 2016, Nature.
[6] M. Rivas,et al. Medical relevance of protein-truncating variants across 337,205 individuals in the UK Biobank study , 2018, Nature Communications.
[7] D. Reich,et al. Population Structure and Eigenanalysis , 2006, PLoS genetics.
[8] Lin Xiao,et al. Dual Averaging Methods for Regularized Stochastic Learning and Online Optimization , 2009, J. Mach. Learn. Res..
[9] H. Wold. Soft Modelling by Latent Variables: The Non-Linear Iterative Partial Least Squares (NIPALS) Approach , 1975, Journal of Applied Probability.
[10] Yaohui Zeng,et al. The biglasso Package: A Memory- and Computation-Efficient Solver for Lasso Model Fitting with Big Data in R , 2017, R J..
[11] C. Robert. Discussion of "Sure independence screening for ultra-high dimensional feature space" by Fan and Lv. , 2008 .
[12] Jennifer R. Harris,et al. Heritability of adult body height: a comparative study of twin cohorts in eight countries. , 2003, Twin research : the official journal of the International Society for Twin Studies.
[13] N. Patterson,et al. Using Extended Genealogy to Estimate Components of Heritability for 23 Quantitative and Dichotomous Traits , 2013, PLoS genetics.
[14] P. Visscher,et al. Inference of the genetic architecture underlying BMI and height with the use of 20,240 sibling pairs. , 2013, American journal of human genetics.
[15] Trevor Hastie,et al. Fast Lasso method for Large-scale and Ultrahigh-dimensional Cox Model with applications to UK Biobank , 2020, bioRxiv.
[16] Stephen Weston,et al. Scalable Strategies for Computing with Massive Data , 2013 .
[17] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.
[18] Naomi R. Wray,et al. Improved polygenic prediction by Bayesian multiple regression on summary statistics , 2019, Nature Communications.
[19] Henrik,et al. Association analyses of 249,796 individuals reveal eighteen new loci associated with body mass index , 2012 .
[20] Ayellet V. Segrè,et al. Hundreds of variants clustered in genomic loci and biological pathways affect human height , 2010, Nature.
[21] P. Visscher,et al. GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.
[22] P. Visscher,et al. Bias, precision and heritability of self-reported and clinically measured height in Australian twins , 2006, Human Genetics.
[23] Jianqing Fan,et al. Sure independence screening for ultrahigh dimensional feature space , 2006, math/0612857.
[24] Luke R. Lloyd-Jones,et al. Signatures of negative selection in the genetic architecture of human complex traits , 2018, Nature Genetics.
[25] H. D. Patterson,et al. Recovery of inter-block information when block sizes are unequal , 1971 .
[26] Manuel A. R. Ferreira,et al. Assumption-Free Estimation of Heritability from Genome-Wide Identity-by-Descent Sharing between Full Siblings , 2006, PLoS genetics.
[27] Jie Wang,et al. Lasso screening rules via dual polytope projection , 2012, J. Mach. Learn. Res..
[28] P. Visscher,et al. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index , 2015, Nature Genetics.
[29] Tian Ge,et al. Polygenic Prediction via Bayesian Regression and Continuous Shrinkage Priors , 2018 .
[30] Martin J. Wainwright,et al. Dual Averaging for Distributed Optimization: Convergence Analysis and Network Scaling , 2010, IEEE Transactions on Automatic Control.
[31] D. Reich,et al. Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.
[32] Louis Lello,et al. Accurate Genomic Prediction of Human Height , 2017, Genetics.
[33] P. Visscher,et al. From Galton to GWAS: quantitative genetics of human height. , 2010, Genetics research.
[34] Laurent El Ghaoui,et al. Safe Feature Elimination for the LASSO and Sparse Supervised Learning Problems , 2010, 1009.4219.
[35] W. G. Hill,et al. Heritability in the genomics era — concepts and misconceptions , 2008, Nature Reviews Genetics.
[36] Carson C Chow,et al. Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.
[37] P. Visscher,et al. Common SNPs explain a large proportion of heritability for human height , 2011 .
[38] Scott Shenker,et al. Spark: Cluster Computing with Working Sets , 2010, HotCloud.
[39] Ross M. Fraser,et al. Genetic studies of body mass index yield new insights for obesity biology , 2015, Nature.
[40] P. Visscher,et al. Estimating missing heritability for disease from genome-wide association studies. , 2011, American journal of human genetics.
[41] P. Donnelly,et al. The UK Biobank resource with deep phenotyping and genomic data , 2018, Nature.
[42] Trevor Hastie,et al. Computer Age Statistical Inference: Algorithms, Evidence, and Data Science , 2016 .
[43] R. Tibshirani,et al. Strong rules for discarding predictors in lasso‐type problems , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.
[44] Christopher M. DeBoever,et al. Components of genetic associations across 2,138 phenotypes in the UK Biobank highlight novel adipocyte biology , 2018, bioRxiv.
[45] R. Tibshirani. Regression Shrinkage and Selection via the Lasso , 1996 .
[46] P. Bickel,et al. SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR , 2008, 0801.1095.
[47] Ross M. Fraser,et al. Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.
[48] Robert Tibshirani,et al. The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.
[49] Christopher M. DeBoever,et al. Components of genetic associations across 2,138 phenotypes in the UK Biobank highlight adipocyte biology , 2019, Nature Communications.
[50] Jian Huang,et al. COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION. , 2011, The annals of applied statistics.
[51] Stephen D. Turner,et al. qqman: an R package for visualizing GWAS results using Q-Q and manhattan plots , 2014, bioRxiv.
[52] H. Zou,et al. Regularization and variable selection via the elastic net , 2005 .
[53] Trevor J. Hastie,et al. Genome-wide association analysis by lasso penalized logistic regression , 2009, Bioinform..
[54] Nicolai Meinshausen,et al. Relaxed Lasso , 2007, Comput. Stat. Data Anal..
[55] Peng Zhao,et al. On Model Selection Consistency of Lasso , 2006, J. Mach. Learn. Res..
[56] Stephen P. Boyd,et al. Proximal Algorithms , 2013, Found. Trends Optim..
[57] E. DeLong,et al. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. , 1988, Biometrics.
[58] D. Cox. Regression Models and Life-Tables , 1972 .
[59] Eric R. Ziegel,et al. The Elements of Statistical Learning , 2003, Technometrics.
[60] Mehryar Mohri,et al. Confidence Intervals for the Area Under the ROC Curve , 2004, NIPS.
[61] D.,et al. Regression Models and Life-Tables , 2022 .
[62] Peter Z. G. Qian,et al. Fast Penalized Regression and Cross Validation for Tall Data with the oem Package , 2018, J. Stat. Softw..