Systematic assessment of COVID-19 host genetics using whole genome sequencing data
by Axel Schmidt, Nicolas Casadei, Fabian Brand, German Demidov, Elaheh Vojgani, Ayda Abolhassani, Rana Aldisi, Guillaume Butler-Laporte, DeCOI host genetics group , T. Madhusankha Alawathurage, Max Augustin, Robert Bals, Carla Bellinghausen, Marc Moritz Berger, Michael Bitzer, Christian Bode, Jannik Boos, Thorsten Brenner, Oliver A. Cornely, Thomas Eggermann, Johanna Erber, Torsten Feldt, Christian Fuchsberger, Julien Gagneur, Siri Göpel, Tobias Haack, Helene Häberle, Frank Hanses, Julia Heggemann, Ute Hehr, Johannes C. Hellmuth, Christian Herr, Anke Hinney, Per Hoffmann, Thomas Illig, Björn-Erik Ole Jensen, Verena Keitel, Sarah Kim-Hellmuth, Philipp Koehler, Ingo Kurth, Anna-Lisa Lanz, Eicke Latz, Clara Lehmann, Tom Luedde, Carlo Maj, Michael Mian, Abigail Miller, Maximilian Muenchhoff, Isabell Pink, Ulrike Protzer, Hana Rohn, Jan Rybniker, Federica Scaggiante, Anna Schaffeldt, Clemens Scherer, Maximilian Schieck, Susanne V. Schmidt, Philipp Schommers, Christoph D. Spinner, Maria J. G. T. Vehreschild, Thirumalaisamy P. Velavan, Sonja Volland, Sibylle Wilfling, Christof Winter, J. Brent Richards, DeCOI , André Heimbach, Kerstin Becker, Stephan Ossowski, Joachim L. Schultze, Peter Nürnberg, Markus M. Nöthen, Susanne Motameny, Michael Nothnagel, Olaf Riess, Eva C. Schulte, Kerstin U. Ludwig
Courses of SARS-CoV-2 infections are highly variable, ranging from asymptomatic to lethal COVID-19. Though research has shown that host genetic factors contribute to this variability, cohort-based joint analyses of variants from the entire allelic spectrum in individuals with confirmed SARS-CoV-2 infections are still lacking. Here, we present the results of whole genome sequencing in 1,220 mainly vaccine-naïve individuals with confirmed SARS-CoV-2 infection, including 827 hospitalized COVID-19 cases. We observed the presence of autosomal-recessive or likely compound heterozygous monogenic disorders in six individuals, all of which were hospitalized and significantly younger than the rest of the cohort. We did not observe any suggestive causal variants in or around the established risk gene TLR7. Burden testing in the largest population subgroup (i.e., Europeans) suggested nominal enrichments of rare variants in coding and non-coding regions of interferon immune response genes in the overall analysis and male subgroup. Case-control analyses of more common variants confirmed associations with previously reported risk loci, with the key locus at 3p21 reaching genome-wide significance. Polygenic scores accurately captured risk in an age-dependent manner. By enabling joint analyses of different types of variation across the entire frequency spectrum, this data will continue to contribute to the elucidation of COVID-19 etiology.