Systematic rare variant analyses identify RAB32 as a susceptibility gene for familial Parkinson’s disease

Hop, Paul J.; Lai, Dongbing; Keagle, Pamela J.; Baron, Desiree M.; Kenna, Brendan J.; Kooyman, Maarten; Shankaracharya; Halter, Cheryl; Straniero, Letizia; Asselta, Rosanna; Bonvegna, Salvatore; Soto-Beasley, Alexandra I.; Wszolek, Zbigniew K.; Uitti, Ryan J.; Isaias, Ioannis Ugo; Pezzoli, Gianni; Ticozzi, Nicola; Ross, Owen A.; Veldink, Jan H.; Foroud, Tatiana M.; Kenna, Kevin P.; Landers, John E.

doi:10.1038/s41588-024-01787-7

Download PDF

Letter
Open access
Published: 10 June 2024

Systematic rare variant analyses identify RAB32 as a susceptibility gene for familial Parkinson’s disease

Nature Genetics volume 56, pages 1371–1376 (2024)Cite this article

12k Accesses
21 Citations
46 Altmetric
Metrics details

Subjects

Abstract

Despite substantial progress, causal variants are identified only for a minority of familial Parkinson’s disease (PD) cases, leaving high-risk pathogenic variants unidentified^1,2. To identify such variants, we uniformly processed exome sequencing data of 2,184 index familial PD cases and 69,775 controls. Exome-wide analyses converged on RAB32 as a novel PD gene identifying c.213C > G/p.S71R as a high-risk variant presenting in ~0.7% of familial PD cases while observed in only 0.004% of controls (odds ratio of 65.5). This variant was confirmed in all cases via Sanger sequencing and segregated with PD in three families. RAB32 encodes a small GTPase known to interact with LRRK2 (refs. ^3,4). Functional analyses showed that RAB32 S71R increases LRRK2 kinase activity, as indicated by increased autophosphorylation of LRRK2 S1292. Here our results implicate mutant RAB32 in a key pathological mechanism in PD—LRRK2 kinase activity^5,6,7—and thus provide novel insights into the mechanistic connections between RAB family biology, LRRK2 and PD risk.

Exome sequencing in Asian populations identifies low-frequency and rare coding variation influencing Parkinson’s disease risk

Article Open access 21 November 2024

Analysis of rare Parkinson’s disease variants in millions of people

Article Open access 08 January 2024

Loss-of-function variants in ITSN1 confer high risk of Parkinson’s disease

Article Open access 15 August 2024

Main

Approximately 10–15% of patients with Parkinson’s disease (PD) are classified as ‘familial cases’, a designation conventionally restricted to patients known to have a first degree relative also affected by the disorder^8,9. The identification of causal rare variants in familial PD has contributed enormously to our current understanding of the disease, ultimately yielding drug targets, biomarkers and key insights into disease mechanisms^1,10,11,12. Currently, seven genes have been classified as definitely associated with familial PD^2,13. The discovery of these familial PD genes has been achieved through various family-based study designs, including linkage analysis, homozygosity mapping and segregation filtering^1,10,12. However, in many cases, conventional family-based study designs are insufficient to identify causal variants due to a combination of genetic heterogeneity across families, reduced penetrance and limited sample size within families. Rare variant association testing methods, such as gene burden analysis, provide an alternative strategy for discovering rare genetic risk factors¹⁴. Instead of leveraging familial structure, these methods leverage case–control differences in cumulative rare variant frequencies. This approach has recently been used to study rare genetic variation in idiopathic PD¹⁵, but thus far, targeted analyses of familial PD cases have not been performed. In previous work, we demonstrated substantial power gains for disease gene discovery in amyotrophic lateral sclerosis by restricting rare variant association testing to familial cases and controls^16,17,18. The rationale of selecting familial cases is to increase sensitivity by enriching for patients carrying rare variants of large effect. In this Letter, we apply this strategy to familial PD, conducting to our knowledge the largest genetic analysis of familial PD so far.

We combined sequencing data from 16 cohorts, including both whole-exome and whole-genome sequencing (WGS) data, comprising a total of 2,824 individuals with familial PD and 78,683 controls. We uniformly processed all sequence data, including alignment to the Genome Reference Consortium human build 38 (GRCh38) reference genome and joint variant calling¹⁹. We retained individuals of predominantly European ancestry who met quality control criteria (2,756 cases and 73,879 controls) and then retained unrelated individuals, resulting in an association cohort consisting of 2,184 index familial PD cases (one affected individual per family) and 69,775 controls (Extended Data Figs. 1–3). Within this cohort, we identified 5,044,315 variants that passed strict quality control, out of which 2,114,963 were predicted to be nonsynonymous (Extended Data Fig. 4).

To identify genes associated with familial PD, we performed gene-based analyses for both low-frequency (minor allele frequency (MAF) <0.05) and ultrarare (≤5 carriers) nonsynonymous variants using an omnibus test that combines burden, sequencing kernel association test (SKAT) and the aggregated Cauchy association variant test (ACAT-V) (Methods)^20,21,22. No significant disease associations were observed within the ultrarare variant category (Extended Data Fig. 5a,b); however, analyses of low-frequency variants (MAF <0.05) revealed disease associations for LRRK2, GBA and RAB32 at exome-wide significance (P < 2.7 × 10⁻⁶; Fig. 1a and Supplementary Data 1). Concurrently, we conducted exome-wide single-variant analyses, identifying four exome-wide significant variants in LRRK2, GBA and RAB32 (Fig. 1b, Extended Data Fig. 5c,d and Supplementary Data 2). Among the four significant variants, three had been previously implicated in PD, including c.1093G > A/p.E365K in GBA; c.6055G > A/p.G2019S and c.4321C > T/p.R1441C in LRRK2 (test statistics for all known PD variants are provided in Supplementary Data 2). The fourth variant, a c.213C > G/p.S71R variant of RAB32 (odds ratio of 65.5, P = 7.8 × 10⁻¹⁶), represented a novel discovery. We observed no evidence of genomic inflation in any of the analyses performed (gene λ_1,000 = 0.94, single variant λ_1,000 = 0.95; Extended Data Fig. 5). Of note, two additional genes (SNAPC1 and C5) were initially flagged as potential exome-wide significant associations but were then confirmed as technical artifacts in subsequent validation analyses (Methods, Supplementary Tables 1 and 2, Extended Data Fig. 5e,f, and Supplementary Figs. 1 and 2).

Fig. 1: Rare nonsynonymous gene-based and single-variant analyses in 2,184 PD cases and 69,775 controls identify *RAB32*, *LRRK2* and *GBA.*

Both the gene-based and single-variant analyses converged on RAB32 as a novel PD gene. Conditional analysis revealed that the observed RAB32 gene association was due to the p.S71R variant (P_conditional = 0.83; Fig. 1b–d), whereas conditional analyses for LRRK2 and GBA confirmed that known residual associations from additional disease relevant rare variants were detectable within the dataset (Fig. 1c; GBA, P_conditional = 3.1 × 10⁻⁶ and LRRK2, P_conditional = 8.7 × 10⁻³). Among 2,184 index PD cases, we identified one homozygous carrier and 15 heterozygous carriers of the RAB32 p.S71R variant (MAF_case of 0.0037), while only three (heterozygous) variant carriers were identified among the 69,775 controls (MAF_ctrl of 0.000022). The MAF observed in the control group is similar to that reported in 44,329 non-neurological European ancestry individuals in the gnomAD (v2.1.1) reference database (MAF_gnomAD of 0.000023), confirming that the variant is very rare in the general population²³. For three PD cases carrying the p.S71R variant, exome sequencing data were available for one of their affected siblings, and the variant segregated with the disease in all three families (Fig. 2 and Extended Data Fig. 6). Thus, in total, we identified 18 PD cases carrying the p.S71R variant, of which 17 were heterozygous carriers and one was a homozygous carrier (no evidence for inbreeding, F = −0.004). Sanger sequencing confirmed the presence of the variant in all 18 PD cases (Supplementary Table 1 and Supplementary Fig. 1). Haplotype analysis indicated that carriers shared a ~300-kb haplotype surrounding the p.S71R variant (Extended Data Fig. 7). Evaluation of other known PD variants revealed the co-occurrence of p.S71R in two carriers with GBA risk variants (c.1093G > A/p.E365K and c.1223C > T/p.T408M) (Extended Data Table 1).

**Fig. 2: Pedigrees of families with multiple sequenced individuals.**

We identified an additional four carriers among 3,380 PD cases (0.12%) in the publicly available Accelerating Medicines Partnership program for PD WGS data (https://www.amp-pd.org/), of which two were familial PD cases (out of 900; 0.22%), one was a sporadic PD case and family history was not available for the fourth case.

Carriers of the RAB32 p.S71R variant had a mean age of onset of 56 years (s.d. 13; range, 39–82 years), which did not significantly differ from noncarriers (Extended Data Fig. 8 and Extended Data Table 1). The variant was observed in 11 males and 7 females, reflecting the overall sex ratio of patients in this study (~60% male). All p.S71R carriers evaluated for L-dopa response were confirmed as L-dopa responsive (15 evaluated and 3 not evaluated) and for the majority (10 out of 18), dyskinesia and/or motor fluctuations were reported. For all but one of the carriers, asymmetry was reported (16 out of 17), and the majority presented with rest tremors (11 out of 18). Moreover, postural instability, freezing of gait and/or falls were common symptoms among carriers reported in 11 out of 18 patients. Detailed case reports were available for nine patients (Supplementary Note). Among them, seven out of nine patients reported nonmotor symptoms, including, among others, hyposmia, autonomic dysfunction, sleep problems and depression. Notably, two patients had a medical history of Hashimoto’s thyroiditis. The case report of the homozygous carrier shows that the clinical presentation did not notably differ from that of heterozygote carriers. He had late-onset (61 years), L-dopa-responsive PD characterized by a right resting tremor and micrographia with motor fluctuations and dyskinesia emerging 9 years after onset. The patient passed away 12 years after the onset, with falls and mild cognitive impairment appearing 10 and 11 years after onset, respectively.

RAB32 belongs to the RAB family of small GTPases, which is composed of 70 members that play key roles in regulating intracellular vesicular transport²⁴. Various members of the RAB family are known to be either substrates or regulators of the key PD protein LRRK2 (refs. ^24,25,26). Most pathogenic LRRK2 variants increase kinase activity, in turn resulting in increased phosphorylation of several RAB family members^25,27,28. Interestingly, while there is no evidence supporting phosphorylation of RAB32 by LRRK2, RAB32 has been shown to interact with LRRK2 through the N-terminal Armadillo domain of LRRK2, and may regulate its subcellular localization^3,4. Moreover, overexpression of RAB29, a member of the RAB32 subfamily, has been shown to activate LRRK2, recruiting it to the trans-Golgi network and increasing its kinase activity²⁶.

Given the established interaction between LRRK2 and RAB32, along with the closely related RAB29 functioning as a LRRK2 activator, we hypothesized that RAB32 S71R results in increased LRRK2 activation. To this end, we performed cotransfections in HEK293 cells, introducing Myc-LRRK2 wild type (WT) alongside either HA-RAB32 WT or HA-RAB32 S71R. Autophosphorylation of S1292 provides a known in vivo readout of LRRK2 kinase activity, as well as a biomarker for the development of parkinsonian phenotypes in patients carrying LRRK2 mutations^27,29,30. As anticipated, we observed a significant increase in LRRK2 S1292 phosphorylation levels in cells expressing mutant RAB32 versus WT RAB32 (approximately threefold increase, P = 0.02; Fig. 3a and Extended Data Fig. 9). In keeping with this, we also observed a threefold increase in the interaction between LRRK2 and RAB32 through coimmunoprecipitation and western blotting (P = 0.034; Fig. 3b and Extended Data Fig. 9). Finally, previous phosphoproteomic analysis indicated that Ser71 constitutes the only known phosphorylation site on the RAB32 protein (Fig. 4)³¹. The S71R variant probably interferes with this phosphorylation event, and this may provide an explanation for why the S71R variant alone was found to exhibit a robust association with PD risk. Taken together, these results support a model where the S71R variant selectively impacts the functioning of RAB32 in a manner that increases its interaction with LRRK2 and, through this, increases LRRK2 activity.

**Fig. 3: RAB32 S71R triggers increased LRRK2 kinase activity.**

**Fig. 4: Three-dimensional protein structure of RAB32 highlighting the phosphorylation site at Ser71.**

In conclusion, we present, to our knowledge, the largest genetic association study of familial PD so far, identifying p.S71R in RAB32 as a novel risk variant and substantiating that the toxicity of this variant is most probably mediated through enhancement of LRRK2 activity. Importantly, substantial evidence points toward increased LRRK2 kinase activity as the primary pathological mechanism in LRRK2 PD and, to some extent, idiopathic PD, and consequently, reduction of LRRK2 activity has emerged as a major therapeutic focus^5,6,7. Future research is needed to determine the mechanism through which RAB32 S71R increases LRRK2 activity. Of particular interest is the possibility that RAB32 S71R leads to tetramerization of LRRK2, similar to RAB29³², which was shown to increase LRRK2 autophosphorylation in contrast to WT RAB32. Our study underscores that, as with prior work in amyotrophic lateral sclerosis^16,17,18, the combination of familial case ascertainment and a sizable control cohort enables the identification of low-frequency variants that were previously limited to detection by linkage analyses. This is evidenced first and foremost by the discovery of RAB32 p.S71R, but also by the recovery of known ultrarare PD variants that had not previously been detectable in genome-wide case–control analyses (LRRK2 p.R1441C, MAF_ctrl 0.000014, P = 2.27 × 10⁻¹⁰). Interestingly, our findings are corroborated by a recent study³³, which discovered that RAB32 p.S71R segregates with PD in three families, not overlapping with our study. Consistent with our results, this parallel study demonstrates that RAB32 S71R enhances LRRK2 autophosphorylation, as well as RAB10 phosphorylation. This independent study provides complementary evidence that validates our findings, thus firmly establishing RAB32 p.S71R as a causal factor in familial PD. Finally, the identification of RAB32 as a familial PD gene provides new insight into the important mechanistic connections between RAB family biology, LRRK2 and PD risk.

Methods

Cohorts

We received approval for this study from the institutional review boards of the participating centers, and written informed consent was obtained from all patients (consent for research). Informed consent was obtained for the publication of individual case reports. The discovery cohort included 2,824 patients with PD and 78,683 controls, totaling 81,507 study participants, of which 10,270 were subjected to WGS and 71,237 to whole-exome sequencing (WXS). One thousand six hundred familial PD cases were from the PROGENI study, which recruited multiplex PD families primarily through a living affected sibling pair. Subsequently, ascertainment was loosened to include PD probands having a positive family history of PD in a first-degree relative, who was not required to be part of the study. All individuals completed an in-person evaluation that included parts II and III of the Unified Parkinson Disease Rating Scale (Lang and Fahn 1989) and a diagnostic checklist that implemented the UK PD Brain Bank inclusion and exclusion criteria. Responses on the diagnostic checklist were then used to classify study subjects as having verified PD or nonverified PD. A total of 783 familial PD cases were recruited at the Mayo clinic, patients were included who fulfilled the clinical diagnostic criteria of PD³⁵. A total of 441 familial PD cases were from the Parkinson Institute Biobank; diagnosis of PD was made according to the UK Brain Bank criteria³⁵. Control cohorts included 7,323 samples from the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine research program³⁶; 49,981 samples from the UK Biobank³⁷; 2,805 samples from Project MinE³⁸; 342 samples from the New York Genome Center’s amyotrophic lateral sclerosis (ALS) consortium and 18,232 samples from database of Genotypes and Phenotypes (included studies and accession number are listed in the ‘Data availability’ section)³⁹.

Sequencing

All raw sequencing data was aligned to the GRCh38 reference genome using Burrow–Wheeler aligner maximal exact match (BWA–MEM; v.2.2.1)⁴⁰ according to the pipeline described by Regier et al.⁴¹ (implementation can be found at GitHub⁴² and Zenodo⁴³). Joint genotyping was performed using a uniform pipeline according to the Genome Analysis Toolkit (GATK) best practices (v. 4.2.6.1)¹⁹. Genotype calls with a quality score <20 were set to missing, variant calls supported by uninformative reads were excluded and multiallelic variants were split into biallelic variants. Male genotypes in nonpseudoautosomal regions on chromosome X were coded as 0 or 1 (according to 0 or 1 allele copies).

Variant annotation

Variants were annotated using snpEff⁴⁴, dbscSNV⁴⁵ and Ensembl Release 105 gene models⁴⁶. Variants were classified as loss-of-function when predicted by snpEff to have a high impact (including nonsense mutations, splice acceptor/donors and frameshift mutations) or predicted as potentially splice altering by dbscSNV (‘ada’ or ‘rf’ score >0.7). Variants were classified as having moderate impact when predicted as such by snpEff (including missense mutations and inframe deletions). For each gene, the impact of a variant was determined by its most severe consequence across protein-coding transcripts.

Sample quality control

Ancestry was estimated by projecting all samples on a reference ancestry space comprising 1000 Genomes samples using the LASER software (v2.04)⁴⁷. We retained individuals of predominantly European ancestry (Extended Data Figs. 1 and 2). We then excluded samples with low genotype call rates (<0.9), discordant sex or deviating heterozygosity (F < −0.1 or F > 0.1). These metrics were calculated in a set of autosomal variants meeting the following criteria: call rate >0.9 in each supercohort (WGS, WXS_UKB and WXS_other), MAF >0.01, and for sex inference, heterozygosity, relatedness and principal component analysis (PCA) variants were additionally filtered on the basis of Hardy–Weinberg equilibrium (P < 0.001) and pruned if in linkage disequilibrium (r² < 0.5, window size of 50, step 5; additionally, high linkage disequilibrium regions were excluded before PCA⁴⁸). We then excluded samples based on a high exome-wide number of single-nucleotide variants (SNVs), insertions/deletions (INDELs), singletons, high INDEL/SNV ratio or deviating Ti/Tv ratio (thresholds listed in Extended Data Fig. 2). Sample duplicates and relatives up to and including the second degree were identified using the KING software⁴⁹. An unrelated sample set was generated by first excluding samples with five or more relations, followed by iteratively excluding individuals with the highest number of relations, resolving ties by prioritizing (in order) PD over controls and WGS over WXS samples. PCA was performed on the unrelated sample set using fastPCA as implemented in PLINK2 (ref. ⁵⁰). A distinct cluster was identified on the fourth and fifth principal component, consisting of an Amish population, which was excluded since the cluster contained only controls (Extended Data Fig. 2).

Variant quality control

First, GATK variant quality score recalibration (VQSR) was applied to all variants using the training data and annotations as recommended by the GATK best practices¹⁹. Variants were excluded if they did not pass variant quality score recalibration, their genotyping rate was <0.9 in any of the supercohorts (WGS, WXS_UKB and WXS_other) or if they did not pass the Hardy–Weinberg equilibrium test in control subjects (P < 0.0001). We then additionally excluded variants with subpar quality scores, variants located in regions showing signs of batch effects and we retained only SNVs. Potential batch effects were identified by testing whether variant minor allele counts were associated with cohort membership within control subjects. Firth’s logistic regression was used to perform these control–control analyses, adjusting for sex and four principal components. This procedure was repeated for each cohort (that is, number 1 represents individual in respective cohort, number 0 represents otherwise). In total, 16 control versus control cohort comparisons were tested (including all WGS controls versus all WXS controls). The minimum P value across these analyses was used as a metric to identify variants associating with probable batch effects. The stringency of various standard variant quality control filters was then increased to eliminate variants exhibiting batch associated calling bias while maintaining maximal sensitivity for unbiased variant calls (Extended Data Fig. 4). In total 5,044,315 variants passed quality control, including 2,114,963 nonsynonymous variants (Extended Data Fig. 4).

Single-variant analyses

Single-variant analyses were performed for all nonsynonymous variants with MAF <0.05 and a minor allele count of at least five (425,827 variants). For each variant, we tested for an association between PD status and minor allele count using Firth’s logistic regression adjusting for sex, the first ten principal components and the total number of rare synonymous variants in each individual²⁰. All tests were two sided and Bonferroni correction was employed to correct for multiple testing. Significant single-variant associations were screened for potential technical biases arising from different sequencing centers among cases by using the same procedure employed in the control–control analyses. Variants where P_case–case < P_{case–control} were flagged as being potentially driven by technical variation (Extended Data Fig. 5). In addition, we performed Sanger sequencing to validate novel identified variants (see ‘Sanger sequencing’ section). Haplotypes were inferred using Beagle (v.5.4) in single-nucleotide polymorphism (SNP) array data (Infinium Global Diversity Array-8)⁵¹.

Gene-based analyses

Gene-based rare variant analyses were performed for 18,242 protein-coding genes, including either low frequency (MAF <0.05) or ultrarare (≤5 carriers) nonsynonymous variants (loss-of-function or moderate impact). For ultrarare variants, we performed burden tests by testing for an association between PD status and the aggregate effect of minor alleles observed per sample per gene. For low-frequency variants (MAF <0.05), we additionally performed SKAT and ACAT-V tests^22,52. To account for the unbalanced case–control ratio, burden tests were performed using Firth’s logistic regression²⁰, the robust version of SKAT was employed²¹ and we adapted ACAT-V so that Firth’s logistic regression is used to perform the burden and single-variant tests that underlie ACAT-V⁵³. Test statistics across the three statistical tests were combined using ACAT, which is designed to combine potentially correlated test statistics²². In each of the tests, we included sex, the first ten principal components and the total number of qualifying synonymous variants in each individual as covariates. Tests were retained when there were at least five variant carriers across the gene.

We performed conditional analyses to assess to what extent gene-based associations were driven by significant variants identified in the single-variant analysis. Conditional gene-based analyses were performed as described above, excluding and adjusting for the significant single variants within the respective gene.

Sanger sequencing

Sanger sequencing of RAB32 exon 1 and SNAPC1 c.1073-2A > T was performed using touchdown PCR (30 cycles with annealing temperature starting at 65 °C and decreasing 0.5 °C per cycle, followed by 15 cycles with an annealing temperature of 65 °C) using primers generated with M13 tails. Amplification was performed using AmpliTaqGold 360 Master mix (Thermo Fisher Scientific, 4398876). Patient DNA samples were provided after extraction from whole blood. PCR products were visualized on a 2% agarose gel using gel electrophoresis and subsequently purified by incubation with Exonuclease I (New England Biolabs, M0293L) and shrimp alkaline phosphatase (Sigma-Aldrich, GEE70092X). Following purification, all PCR products were sequenced at the MGH Massachusetts General Hospital DNA Core Facility. Sequence analysis was performed using the PHRED/PHRAP/Consed software suite (http://www.phrap.org/) and variations in the sequences were identified with the Polyphred v6.15 software⁵⁴. The PCR and sequencing primers are listed in Supplementary Tables 1 and 2. Primer set 1a in Supplementary Table 1 is for one sample (sample 17 in Extended Data Table 1) that has a primer sitting on top of the common SNP, rs41285869. New primers were designed to avoid this common SNP as a false homozygous variant was visualized as compared with the exome sequencing results.

Immunoprecipitation

Immortalized cell culture

HEK293 cells (ATCC) were maintained at 37 °C with 5% CO₂ in Dulbecco’s modified Eagle medium (Invitrogen) supplemented with 10% (vol/vol) heat inactivated fetal bovine serum (MediaTech), 2 mM l-glutamine (Gibco) and 1% (vol/vol) penicillin and streptomycin solution (Gibco).

Plasmids used for this study

The 2xMyc-LRRK2 WT plasmid was acquired from Addgene (number 25361) and confirmed via whole plasmid sequencing at Massachusetts General Hospital. The HA-RAB32 constructs, WT and S71R, were made and sequenced by Genescript. All constructs were prepped with a Maxi prep kit (Qiagen) following the manufacturer’s instructions.

Immunoprecipitation and western blot

HEK293 cells were transfected with Lipofectamine 2000 (Invitrogen) according to the manufacturer’s instructions using a DNA:Lipo ratio of 3 μg:6 μl per well of a six-well plate. For cotransfections, the DNA amount was divided at a 2:1 ratio for LRRK2 WT:RAB32 constructs. A total of 48 h posttransfection, the lysates were collected in denaturing hypotonic lysis buffer⁵⁵ and protein concentration determined by Pierce BCA Protein Assay (Thermo Fisher). The 1× PBS + 0.1% Tween 20 washed Myc–antibody conjugated magnetic beads were combined with lysate in a 1 μl:5 μg ratio before rotating the immunoprecipitation overnight at 4 °C. The beads were washed with denaturing wash buffer⁵⁵ before protein elution by boiling in 2× loading sample buffer (Boston Bioproducts). Approximately 50% of the eluate from each condition was run on a 4–20% gradient gel (Bio-Rad) and transferred to a polyvinylidene difluoride membrane (Bio-Rad) on the high molecular weight setting of a Trans-blot Turbo system (Bio-Rad). The membranes were processed as described previously⁵⁶. Primary antibodies were used as follows: 1:2,000 for rabbit anti-HA (DSHB, catalog number anti-HA rRb-IgG), 1:1,000 mouse anti-Myc (BioLegend, catalog number 626802) and 1:300 Phos-LRRK2 S1292 (Abcam, catalog number 203181). Blots were visualized with an Odyssey Infrared Imager (LiCor, model 9120).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

Exome sequencing data from the case-cohort is available under dbGaP study phs001004 or are available via the corresponding author subject to the data sharing terms of the respective cohorts. Project MinE data are available upon request (https://www.projectmine.com/research/data-sharing/). dbGAP datasets used are available under the following accession numbers: Alzheimer’s Disease Sequencing Project (ADSP) (phs000572), Autism Sequencing Consortium (ASC) (phs000298), Sweden-Schizophrenia Population-Based Case–Control Exome Sequencing (phs000473), Inflammatory Bowel Disease Exome Sequencing Study (phs001076), Myocardial Infarction Genetics Exome Sequencing Consortium: Ottawa Heart Study (phs000806), Myocardial Infarction Genetics Exome Sequencing Consortium: Malmo Diet and Cancer Study (phs001101), Myocardial Infarction Genetics Exome Sequencing Consortium: University of Leicester (phs001000), Myocardial Infarction Genetics Exome Sequencing Consortium: Italian Atherosclerosis Thrombosis and Vascular Biology (phs000814), NHLBI GO-ESP: Women’s Health Initiative Exome Sequencing Project (WHI)—WHISP (phs000281), Building on GWAS for NHLBI diseases: the US CHARGE Consortium (CHARGE-S)—CHS (phs000667), Building on GWAS for NHLBI Diseases: the US CHARGE Consortium (CHARGE-S)—ARIC (phs000668), Building on GWAS for NHLBI diseases: the US CHARGE Consortium (CHARGE-S)—FHS (phs000651), NHLBI GO-ESP Family Studies: Idiopathic Bronchiectasis (phs000518), NHLBI GO-ESP: Family Studies (Hematological Cancers) (phs000632), NHLBI GO-ESP: Family Studies (familial atrial fibrillation) (phs000362), NHLBI GO-ESP: Heart Cohorts Exome Sequencing Project (ARIC) (phs000398), NHLBI GO-ESP: Heart Cohorts Exome Sequencing Project (CHS) (phs000400), NHLBI GO-ESP: Heart Cohorts Exome Sequencing Project (FHS) (phs000401), NHLBI GO-ESP: Lung Cohorts Exome Sequencing Project (asthma) (phs000422), NHLBI GO-ESP: Lung Cohorts Exome Sequencing Project (COPDGene) (phs000296), GO-ESP: Family Studies (thoracic aortic aneurysms leading to acute aortic dissections) (phs000347), NHLBI TOPMed: Genomic Activities such as Whole Genome Sequencing and Related Phenotypes in the Framingham Heart Study (phs000974), NHLBI TOPMed: Genetics of Cardiometabolic Health in the Amish (phs000956), NHLBI TOPMed: Genetic Epidemiology of COPD (COPDGene) (phs000951), NHLBI TOPMed: The Vanderbilt Atrial Fibrillation Registry (VU_AF) (phs001032), NHLBI TOPMed: Cleveland Clinic Atrial Fibrillation (CCAF) Study (phs001189), NHLBI TOPMed: Partners HealthCare Biobank (phs001024), NHLBI TOPMed—NHGRI CCDG: Massachusetts General Hospital (MGH) Atrial Fibrillation Study (phs001062), NHLBI TOPMed: Novel Risk Factors for the Development of Atrial Fibrillation in Women (phs001040), NHLBI TOPMed—NHGRI CCDG: The Vanderbilt AF Ablation Registry (phs000997), NHLBI TOPMed: Heart and Vascular Health Study (HVH) (phs000993), NHLBI TOPMed—NHGRI CCDG: Atherosclerosis Risk in Communities (ARIC) (phs001211), NHLBI TOPMed: The Genetics and Epidemiology of Asthma in Barbados (phs001143), NHLBI TOPMed: Women’s Health Initiative (WHI) (phs001237), NHLBI TOPMed: Whole Genome Sequencing of Venous Thromboembolism (WGS of VTE) (phs001402), and NHLBI TOPMed: Trans-Omics for Precision Medicine (TOPMed) Whole Genome Sequencing Project—Cardiovascular Health Study (phs001368). Additional data used include: GATK hg38 resource bundle (https://console.cloud.google.com/storage/browser/genomics-public-data/resources/broad/hg38/v0/), BWA–MEM GRCh38 reference genome (run ‘bwa.kit/run-gen-ref hs38DH’ from https://github.com/lh3/bwa/tree/master/bwakit), 1000 Genomes phase 3 (ftp://ftp-trace.ncbi.nih.gov/1000genomes/ftp/release/20110521), Ensembl v.105 (https://ftp.ensembl.org/pub/release-105/gtf/homo_sapiens/Homo_sapiens.GRCh38.105.gtf.gz), gnomAD v.2.1.1 (non-neuro) (https://gnomad.broadinstitute.org/variant/6-146865220-C-G?dataset=gnomad_r2_1_non_neuro), and Accelerating Medicines Partnership (AMP) program for PD (https://amp-pd.org/). Source data are provided with this paper.

Code availability

All raw sequencing data were aligned to the GRCh38 reference genome using BWA–MEM (v.2.2.1) according to the pipeline described by Regier et al. (implementation can be found via GitHub at https://github.com/maarten-k/realignment (ref. ⁴²) and via Zenodo at https://doi.org/10.5281/zenodo.10963076 (ref. ⁴³). Joint genotyping was performed using a uniform pipeline according to the GATK best practices (v. 4.2.6.1). Handling and filtering of VCF files was performed using VCFtools (v. 0.1.16), BCFtools (v. 1.9) and PLINK (v. 1.90b6.21). Ancestry was estimated using LASER (v.2.04). Variants were annotated using Ensembl (GRCh38.105), snpEff (v.5.1d) and dbNSFP (v. 4.3a). Sample and variant quality control was performed using PLINK (v. 1.90b6.21) and RVAT (v. 0.2.0), while sample relatedness was inferred using KING (v. 2.2.7). All downstream analyses were performed using custom R code (performed in R 3.6.3) that we made available in the RVAT R package (v. 0.2.0) available via GitHub at https://github.com/kennalab/rvat (ref. ⁵³) and via Zenodo at https://doi.org/10.5281/zenodo.10973472 (ref. ⁵⁷). Other R packages used either as dependencies of RVAT or in other analyses and visualizations are ggplot2 (v. 3.4.2), ggrepel (v. 0.9.1), dplyr (v. 1.0.7), readr (v. 2.1.1), stringr (v. 1.4.0), tidyr (v. 1.1.4), magrittr (v. 2.0.1), kinship2 (v. 1.9.6), logistf (v. 1.25.0), SKAT (v. 2.2.5), SummarizedExperiment (v. 1.16.1), S4Vectors (v. 0.24.4), GenomicRanges (v. 1.38.0), IRanges (v. 2.20.2), libraDBI (v. 1.1.3) and RSQLite (v. 2.3.1). Sequence analysis for Sanger sequencing was performed using the v.29.0 PHRED/PHRAP/Consed software suite (http://www.phrap.org/) and variations in the sequences were identified via the Polyphred v6.15 software at http://droog.gs.washington.edu/polyphred/) (ref. ⁵⁴).

References

Blauwendraat, C., Nalls, M. A. & Singleton, A. B. The genetic architecture of Parkinson’s disease. Lancet Neurol. 19, 170–178 (2020).
CAS PubMed Google Scholar
Rehm, H. L. et al. ClinGen—the clinical genome resource. N. Engl. J. Med. 372, 2235–2242 (2015).
CAS PubMed PubMed Central Google Scholar
Waschbüsch, D. et al. LRRK2 transport is regulated by its novel interacting partner Rab32. PLoS ONE 9, e111632 (2014).
PubMed PubMed Central Google Scholar
McGrath, E., Waschbüsch, D., Baker, B. M. & Khan, A. R. LRRK2 binds to the Rab32 subfamily in a GTP-dependent manner via its armadillo domain. Small GTPases 12, 133–146 (2021).
CAS PubMed Google Scholar
Berwick, D. C., Heaton, G. R., Azeggagh, S. & Harvey, K. LRRK2 biology from structure to dysfunction: research progresses, but the themes remain the same. Mol. Neurodegener. 14, 49 (2019).
CAS PubMed PubMed Central Google Scholar
Tolosa, E., Vila, M., Klein, C. & Rascol, O. LRRK2 in Parkinson disease: challenges of clinical trials. Nat. Rev. Neurol. 16, 97–107 (2020).
PubMed Google Scholar
Taymans, J.-M. et al. Perspective on the current state of the LRRK2 field. NPJ Park. Dis. 9, 104 (2023).
CAS Google Scholar
Shino, M. Y. et al. Familial aggregation of Parkinson’s disease in a multiethnic community-based case-control study. Mov. Disord. 25, 2587–2594 (2010).
PubMed PubMed Central Google Scholar
Farlow, J. L. et al. Whole-exome sequencing in familial Parkinson disease. JAMA Neurol. 73, 68 (2016).
PubMed PubMed Central Google Scholar
Hernandez, D. G., Reed, X. & Singleton, A. B. Genetics in Parkinson disease: Mendelian versus non-Mendelian inheritance. J. Neurochem. 139, 59–74 (2016).
CAS PubMed PubMed Central Google Scholar
Jankovic, J. & Tan, E. K. Parkinson’s disease: etiopathogenesis and treatment. J. Neurol. Neurosurg. Psychiatry 91, 795–808 (2020).
PubMed Google Scholar
Bandres-Ciga, S., Diez-Fairen, M., Kim, J. J. & Singleton, A. B. Genetics of Parkinson’s disease: an introspection of its journey towards precision medicine. Neurobiol. Dis. 137, 104782 (2020).
CAS PubMed PubMed Central Google Scholar
Parkinson’s disease gene curation expert panel. ClinGen https://clinicalgenome.org/affiliation/40079/ (2024).
Povysil, G. et al. Rare-variant collapsing analyses for complex traits: guidelines and applications. Nat. Rev. Genet. 20, 747–759 (2019).
CAS PubMed Google Scholar
Makarious, M. B. et al. Large-scale rare variant burden testing in Parkinson’s disease. Brain 146, 4622–4632 (2023).
PubMed PubMed Central Google Scholar
Smith, B. N. et al. Exome-wide rare variant analysis identifies TUBA4A mutations associated with familial ALS. Neuron 84, 324–331 (2014).
CAS PubMed PubMed Central Google Scholar
Kenna, K. P. et al. NEK1 variants confer susceptibility to amyotrophic lateral sclerosis. Nat. Genet. 48, 1037–1042 (2016).
CAS PubMed PubMed Central Google Scholar
Nicolas, A. et al. Genome-wide analyses identify KIF5A as a novel ALS gene. Neuron 97, 1268–1283 (2018).
CAS PubMed Central Google Scholar
Van der Auwera, G. D. & O’Connor, B. D. Genomics in the Cloud: Using Docker, GATK, and WDL in Terra (O’Reilly Media, Inc., 2020).
Firth, D. Bias reduction of maximum likelihood estimates. Biometrika 80, 27–38 (1993).
Google Scholar
Zhao, Z. et al. UK Biobank whole-exome sequence binary phenome analysis with robust region-based rare-variant test. Am. J. Hum. Genet. 106, 3–12 (2020).
CAS PubMed Google Scholar
Liu, Y. et al. ACAT: a fast and powerful P value combination method for rare-variant analysis in sequencing studies. Am. J. Hum. Genet. 104, 410–421 (2019).
CAS PubMed PubMed Central Google Scholar
Genome Aggregation Database Consortium et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).
Google Scholar
Lara Ordóñez, A. J., Fasiczka, R., Naaldijk, Y. & Hilfiker, S. Rab GTPases in Parkinson’s disease: a primer. Essays Biochem. 65, 961–974 (2021).
PubMed PubMed Central Google Scholar
Steger, M. et al. Systematic proteomic analysis of LRRK2-mediated Rab GTPase phosphorylation establishes a connection to ciliogenesis. eLife 6, e31012 (2017).
PubMed PubMed Central Google Scholar
Purlyte, E. et al. Rab29 activation of the Parkinson’s disease-associated LRRK2 kinase. EMBO J. 37, 1–18 (2018).
CAS PubMed Google Scholar
Di Maio, R. et al. LRRK2 activation in idiopathic Parkinson’s disease. Sci. Transl. Med. 10, eaar5429 (2018).
PubMed PubMed Central Google Scholar
Kalogeropulou, A. F. et al. Impact of 100 LRRK2 variants linked to Parkinson’s disease on kinase activity and microtubule binding. Biochem. J. 479, 1759–1783 (2022).
CAS PubMed Google Scholar
Sheng, Z. et al. Ser¹²⁹² autophosphorylation is an indicator of LRRK2 kinase activity and contributes to the cellular effects of PD mutations. Sci. Transl. Med. 4, 164ra161 (2012).
PubMed Google Scholar
Fraser, K. B. et al. Ser(P)-1292 LRRK2 in urinary exosomes is elevated in idiopathic Parkinson’s disease. Mov. Disord. 31, 1543–1550 (2016).
CAS PubMed PubMed Central Google Scholar
Zhou, H. et al. Toward a comprehensive characterization of a human cancer cell phosphoproteome. J. Proteome Res. 12, 260–271 (2013).
CAS PubMed Google Scholar
Zhu, H. et al. Rab29-dependent asymmetrical activation of leucine-rich repeat kinase 2. Science 382, 1404–1411 (2023).
CAS PubMed PubMed Central Google Scholar
Gustavsson, E. K. et al. RAB32 Ser71Arg in autosomal dominant Parkinson’s disease: linkage, association, and functional analyses. Lancet Neurol. 23, 603–614 (2024).
Khan, A. R., Kecman, T. PDB entry - 6FF8. Crystal structure of uncomplexed Rab32 in the active GTP-bound state at 2.13 angstrom resolution. Protein Data Bank https://doi.org/10.2210/pdb6ff8/pdb (2024).
Hughes, A. J., Daniel, S. E., Kilford, L. & Lees, A. J. Accuracy of clinical diagnosis of idiopathic Parkinson’s disease: a clinico-pathological study of 100 cases. J. Neurol. Neurosurg. Psychiatry 55, 181–184 (1992).
CAS PubMed PubMed Central Google Scholar
Taliun, D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed program. Nature 590, 290–299 (2021).
CAS PubMed PubMed Central Google Scholar
Sudlow, C. et al. UK Biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 12, e1001779 (2015).
PubMed PubMed Central Google Scholar
Project MinE ALS Sequencing Consortium. Project MinE: study design and pilot analyses of a large-scale whole-genome sequencing study in amyotrophic lateral sclerosis. Eur. J. Hum. Genet. 26, 1537–1546 (2018).
Google Scholar
Tryka, K. A. et al. NCBI’s database of genotypes and phenotypes: dbGaP. Nucleic Acids Res. 42, D975–D979 (2014).
CAS PubMed Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
CAS PubMed PubMed Central Google Scholar
Regier, A. A. et al. Functional equivalence of genome sequencing analysis pipelines enables harmonized variant calling across human genetics projects. Nat. Commun. 9, 4038 (2018).
PubMed PubMed Central Google Scholar
realignment. GitHub https://github.com/maarten-k/realignment (2023).
Kooyman, M. maarten-k/realignment: pipeline for exome and WGS(DF3) pipeline. Zenodo https://doi.org/10.5281/zenodo.10963076 (2024).
Cingolani, P. et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w¹¹¹⁸; iso-2; iso-3. Fly 6, 80–92 (2012).
CAS PubMed PubMed Central Google Scholar
Jian, X., Boerwinkle, E. & Liu, X. In silico prediction of splice-altering single nucleotide variants in the human genome. Nucleic Acids Res. 42, 13534–13544 (2014).
CAS PubMed PubMed Central Google Scholar
Zerbino, D. R. et al. Ensembl 2018. Nucleic Acids Res. 46, D754–D761 (2018).
CAS PubMed Google Scholar
Wang, C., Zhan, X., Liang, L., Abecasis, G. R. & Lin, X. Improved ancestry estimation for both genotyping and sequencing data using projection procrustes analysis and genotype imputation. Am. J. Hum. Genet. 96, 926–937 (2015).
CAS PubMed PubMed Central Google Scholar
Anderson, C. A. et al. Data quality control in genetic case–control association studies. Nat. Protoc. 5, 1564–1573 (2010).
CAS PubMed PubMed Central Google Scholar
Manichaikul, A. et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010).
CAS PubMed PubMed Central Google Scholar
Galinsky, K. J. et al. Fast principal component analysis reveals convergent evolution of ADH1B in Europe and East Asia. Am. J. Hum. Genet. 98, 456–472 (2016).
CAS PubMed PubMed Central Google Scholar
Browning, B. L., Tian, X., Zhou, Y. & Browning, S. R. Fast two-stage phasing of large-scale sequence data. Am. J. Hum. Genet. 108, 1880–1890 (2021).
CAS PubMed PubMed Central Google Scholar
Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am. J. Hum. Genet. 89, 82–93 (2011).
CAS PubMed PubMed Central Google Scholar
Hop, P. J. & Kenna, K. P. RVAT: rare variant association toolkit. GitHub https://github.com/kennalab/rvat (2024).
Nickerson, D. A., Tobe, V. O. & Taylor, S. L. PolyPhred: automating the detection and genotyping of single nucleotide substitutions using fluorescence-based resequencing. Nucleic Acids Res. 25, 2745–2751 (1997).
CAS PubMed PubMed Central Google Scholar
Singh, G., Ricci, E. P. & Moore, M. J. RIPiT-seq: a high-throughput approach for footprinting RNA:protein complexes. Methods 65, 320–332 (2014).
CAS PubMed Google Scholar
Baron, D. M. et al. ALS-associated KIF5A mutations abolish autoinhibition resulting in a toxic gain of function. Cell Rep. 39, 110598 (2022).
CAS PubMed PubMed Central Google Scholar
KennaLab/rvat: v.2.09. Zenodo https://doi.org/10.5281/zenodo.10973472 (2024).

Download references

Acknowledgements

We acknowledge S. Goldwurm for his contributions to this work. This research has been conducted using the UK Biobank Resource under application number 48361. Grants from National Institutes of Health (NIH)/National Institute of Neurological Disorders and Stroke (NINDS) R01NS096740 and R56NS082349 to J.E.L. and T.M.F. Grant from NIH/NINDS: R61NS130604 was given to J.E.L. and from NIH/NINDS: R01NS037167 was given to T.M.F. K.K. is supported by grants from the Dutch Research Council (grant no. ZonMW-VIDI 91719350) and the ALS Foundation Netherlands. All New York Genome Center ALS Consortium activities are supported by the ALS Association (ALSA, 19-SI-459) and the Tow Foundation. The Target ALS Human Postmortem Tissue Core, New York Genome Center for Genomics of Neurodegenerative Disease, Amyotrophic Lateral Sclerosis Association and Tow Foundation. Z.K.W. is partially supported by the NIH/National Institute on Aging and NIH/NINDS (1U19AG063911, FAIN: U19AG063911), Mayo Clinic Center for Regenerative Medicine, the gifts from the Donald G. and Jodi P. Heeringa Family, the Haworth Family Professorship in Neurodegenerative Diseases fund, The Albertson Parkinson’s Research Foundation and PPND Family Foundation. O.A.R. is supported in part by NIH (P50-NS072187; R01-NS078086; U54-NS100693; U54-NS110435), DOD (W81XWH-17-1-0249), The Michael J. Fox Foundation, The Little Family Foundation, T. Turner and family, the Mayo Clinic Foundation, and the Center for Individualized Medicine. This study was supported by the generous contribution of the ‘Fondazione Grigioni per il Morbo di Parkinson’, Milan (Italy), a charitable association linked to AIP (the ‘Italian Association of Parkinsonians’; http://www.parkinson.it/fondazione-grigioni.html). The ‘Fondazione Grigioni per il Morbo di Parkinson’ paid part of lab expenses and the salary of L.S. DNA samples belong to the ‘Parkinson Institute Biobank’, member of the Telethon Network of Genetic Biobanks (http://biobanknetwork.telethon.it). This project has received funding from the European Research Council under the European Union’s Horizon 2020 Research and Innovation programme (grant agreement no. 772376–EScORIAL). Data from the New York Genome Center ALS consortium were used. All consortium members are listed in the Supplementary Note.

Author information

These authors contributed equally: Paul J. Hop, Dongbing Lai.
These authors jointly supervised this work: Tatiana M. Foroud, Kevin P. Kenna, John E. Landers.
A full list of members and their affiliations appears in the Supplementary Information.

Authors and Affiliations

Department of Translational Neuroscience, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
Paul J. Hop & Kevin P. Kenna
Department of Neurology, UMC Utrecht Brain Center, University Medical Center Utrecht, Utrecht, the Netherlands
Paul J. Hop, Brendan J. Kenna, Maarten Kooyman & Jan H. Veldink
Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
Dongbing Lai, Cheryl Halter & Tatiana M. Foroud
Department of Neurology, UMass Chan Medical School, Worcester, MA, USA
Pamela J. Keagle, Desiree M. Baron, Shankaracharya & John E. Landers
Department of Biomedical Sciences, Humanitas University, Milan, Italy
Letizia Straniero & Rosanna Asselta
IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy
Letizia Straniero & Rosanna Asselta
Parkinson Institute, ASST Gaetano Pini-CTO, Milan, Italy
Salvatore Bonvegna, Ioannis Ugo Isaias & Gianni Pezzoli
Department of Neuroscience, Mayo Clinic, Jacksonville, FL, USA
Alexandra I. Soto-Beasley & Owen A. Ross
Department of Neurology, Mayo Clinic, Jacksonville, FL, USA
Zbigniew K. Wszolek & Ryan J. Uitti
Department of Neurology, University Hospital of Würzburg and Julius Maximilian University of Würzburg, Würzburg, Germany
Ioannis Ugo Isaias
Fondazione Grigioni per il Morbo di Parkinson, Milan, Italy
Gianni Pezzoli
Department of Neurology–Stroke Unit and Laboratory of Neuroscience, Istituto Auxologico Italiano IRCCS, Milan, Italy
Nicola Ticozzi
Department of Pathophysiology and Transplantation, ‘Dino Ferrari’ Center, Università degli Studi di Milano, Milan, Italy
Nicola Ticozzi
Department of Clinical Genomics, Mayo Clinic, Jacksonville, FL, USA
Owen A. Ross

Authors

Paul J. Hop
View author publications
You can also search for this author inPubMed Google Scholar
Dongbing Lai
View author publications
You can also search for this author inPubMed Google Scholar
Pamela J. Keagle
View author publications
You can also search for this author inPubMed Google Scholar
Desiree M. Baron
View author publications
You can also search for this author inPubMed Google Scholar
Brendan J. Kenna
View author publications
You can also search for this author inPubMed Google Scholar
Maarten Kooyman
View author publications
You can also search for this author inPubMed Google Scholar
Shankaracharya
View author publications
You can also search for this author inPubMed Google Scholar
Cheryl Halter
View author publications
You can also search for this author inPubMed Google Scholar
Letizia Straniero
View author publications
You can also search for this author inPubMed Google Scholar
Rosanna Asselta
View author publications
You can also search for this author inPubMed Google Scholar
Salvatore Bonvegna
View author publications
You can also search for this author inPubMed Google Scholar
Alexandra I. Soto-Beasley
View author publications
You can also search for this author inPubMed Google Scholar
Zbigniew K. Wszolek
View author publications
You can also search for this author inPubMed Google Scholar
Ryan J. Uitti
View author publications
You can also search for this author inPubMed Google Scholar
Ioannis Ugo Isaias
View author publications
You can also search for this author inPubMed Google Scholar
Gianni Pezzoli
View author publications
You can also search for this author inPubMed Google Scholar
Nicola Ticozzi
View author publications
You can also search for this author inPubMed Google Scholar
Owen A. Ross
View author publications
You can also search for this author inPubMed Google Scholar
Jan H. Veldink
View author publications
You can also search for this author inPubMed Google Scholar
Tatiana M. Foroud
View author publications
You can also search for this author inPubMed Google Scholar
Kevin P. Kenna
View author publications
You can also search for this author inPubMed Google Scholar
John E. Landers
View author publications
You can also search for this author inPubMed Google Scholar

Consortia

Project MinE ALS Sequencing Consortium

Nicola Ticozzi
, Jan H. Veldink
& John E. Landers

Contributions

Experiments and data analysis were performed by P.J.H., D.L., P.J.K., D.M.B., T.M.F., K.P.K. and J.E.L. Sample ascertainment and data generation was carried out by P.J.H., D.L., P.J.K., D.M.B., B.J.K., M.K., S., C.H., L.S., R.A., S.B., A.I.S.-B., Z.K.W., R.J.U., I.U.I., G.P., N.T., O.A.R., J.H.V., T.M.F., K.P.K. and J.E.L. Writing of the paper was performed by P.J.H., D.L., T.M.F., K.P.K. and J.E.L. Study supervision was carried out by T.M.F., K.P.K. and J.E.L. All authors have seen and approved the paper.

Corresponding author

Correspondence to John E. Landers.

Ethics declarations

Competing interests

Z.K.W. serves as principal investigator (PI) or co-PI on Biohaven Pharmaceuticals, Inc. (BHV4157-206) and Vigil Neuroscience, Inc. (VGL101-01.002, VGL101-01.201, positron emission tomography tracer development protocol, Csf1r biomarker and repository project, and ultrahigh field magnetic resonance imaging in the diagnosis and management of CSF1R-related adult-onset leukoencephalopathy with axonal spheroids and pigmented glia) projects/grants. He serves as co-PI of the Mayo Clinic APDA Center for Advanced Research and as an external advisory board member for the Vigil Neuroscience, Inc., and as a consultant on neurodegenerative medical research for Eli Lilli and Company. J.H.V. reports to have sponsored research agreements with Biogen and Astra Zeneca. J.E.L. is a consultant for Illios Therapeutics, WCG Clinical and Biogen. The other authors declare no competing interests.

Peer review

Peer review information

Nature Genetics thanks Roy Alcalay, Henry Houlden and the other, anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Overview of the sample quality control (QC) steps leading to the final cohort of 2,184 individuals with PD and 69,775 controls.

For each filter, the number of samples retained after applying the filter is shown. First, samples of broad European ancestry were retained, then samples were excluded in case of low call rate (<0.9), outlying heterozygosity rate (F < − 0.1 or F > 0.1), genetically predicted sex that did not match reported sex, a deviant number of SNVs, INDELs, singletons or outlying Ti/Tv or SNV/INDEL ratio, ≤2^nd degree relatedness (one of each pair is kept) or outlying values on the first five principal components.

Extended Data Fig. 2 Sample QC.

a,b, Projection of all samples onto PCA coordinates of a reference ancestry space comprising 1000 Genomes samples. Grey dots indicate the 81,507 samples included in this study. Colored dots indicate the 1000 Genomes samples, colored by superpopulation label, onto which the study samples were projected. c, Sample call-rate distribution. Samples with a call-rate < 0.9 were excluded. d, Heterozygosity. Samples with F < − 0.1 or F > 0.1 were excluded. e, X-chromosome homozygosity. Samples with ambiguous sex (0.4 < F < 0.6) or where genetically predicted sex did not match reported sex were excluded (F < 0.4 = female; F > 0.6 = male). f, Principal component analysis (PCA). A distinct cluster was identified on the fifth principal component, samples with PC5 < −0.02 were excluded. g, Total variant count distributions. Samples were excluded if nSNV > 4000, nINDEL > 400, nSNV (Singletons) > 250, nINDEL (Singletons) > 100, Ti/Tv ratio < 2.4 or > 3.7 or INDEL/SNV ratio > 0.165.

Extended Data Fig. 3 Principal components analysis of final analysis cohort consisting of 2,184 individuals with PD and 69,775 controls.

a, Principal component 1 and 2. b, Principal component 3 and 4.

Extended Data Fig. 4 Variant quality control.

a–d, Calibration of quality score thresholds for variants that pass basic quality control (VQSR-pass, per-supercohort call-rate > 0.9, HWE equilibrium P-value in controls > 0.0001). Plots show the cumulative fraction of variants split by case-control P-value (two-tailed; Firth’s logistic regression). Thresholds were set at inflection points so that the majority of variants were retained while excluding a high proportion of variants strongly associated with case-control status. The following thresholds were applied: QD ≥ 6, MQRankSum ≥ −0.5, FS ≤ 14, and control-control minimum P-value ≥ 0.0001 (two-tailed; Firth’s logistic regression). e, Overview of variant QC steps. The first row shows the total number of called variants, the second row shows the number of variants passing basic QC, and the third row shows the number of variants passing strict QC. The strict QC variant filters were used in all analyses presented in this manuscript.

Extended Data Fig. 5 Rare variant analyses.

a, Quantile-quantile (qq) plot of observed gene two-tailed −log₁₀ (P-values) versus expected two-tailed −log₁₀ (P-values) under the null model. Gene-based associations of low-frequency (MAF < 0.05) variants were estimated using the ACAT omnibus test including Firth’s logistic regression, SKAT and ACAT-V. Gene-based associations of ultra-rare variants (≤5 carriers) were estimated using Firth’s logistic regression. b, Gene-based analysis of ultra-rare (≤5 carriers) non-synonymous variants. The y-axis shows the gene-based associations (two-tailed −log₁₀(P-value)) plotted against genomic coordinates on the x-axis (GRCh38). The dashed line indicates exome-wide significance (P = 2.74 × 10⁻⁶). c, Quantile-quantile (qq) plot of observed single variant two-tailed −log₁₀ (P-values) versus expected two-tailed −log₁₀ (P-values) under the null model. The red dotted line indicates the exome-wide significance threshold (P = 1.17 × 10⁻⁷). λ indicates the observed genomic inflation factor, λ₁₀₀₀ indicates the genomic inflation factor for an equivalent study of 1,000 cases and 1,000 controls. d, The y-axis shows the single variant associations estimated using Firth’s logistic regression (two-tailed −log₁₀(P-value)) plotted against genomic coordinates on the x-axis (GRCh38). The dashed line indicates the exome-wide significance threshold (P = 1.17 × 10⁻⁷). e,f, Significant genes and single variants were screened for technical biases arising from different sequencing centers by testing for an association with sequencing center (n = 1,048 and n = 1,136) among PD cases. The y-axis shows the two-tailed −log₁₀ (P-values) from this case-case analysis compared to the two-tailed −log₁₀ (P-values) from the case-control analysis. Genes and single variants were excluded if P_case-case ≤ P_case-con. In both gene-based (e) and single-variant analyses (f), technical bias was observed in the C5 gene. This was caused by one variant (C5; c.3127 C > A), of which all 44 carriers were sequenced in the same sequencing center.

Extended Data Fig. 6 Pedigrees of families with one sequenced individual.

Unaffected family members are indicated by white symbols, affected family members with verified PD are indicated by black symbols, family members with non-verified or reported PD are indicated by grey symbols. For individuals with PD, age of onset is displayed at the left and age at death or last follow-up is displayed on the right (separated by a ‘|’). For non-affected individuals, the age at death or last follow-up is displayed.

Extended Data Fig. 7 Shared haplotype for RAB32 p.S71R.

The figure shows the shared haplotype in a ± 250-kb window surrounding the p.S71R variant based on common markers on the Infinium Global Diversity Array-8 (available for 16 out of 18 p.S71R carriers). The y-axis shows SNP IDs with the distance to the p.S71R variant in parentheses, and the x-axis shows the p.S71R genotype (both alleles are shown for the homozygous carrier). The minimal common region extends from -131,546 bp downstream to 182,121 bp upstream of the p.S71R variant.

Extended Data Fig. 8 Age of onset (AOO) distribution of RAB32 p.S71R carriers versus RAB32 wild-type patients.

Age at onset was known for 1,414 unrelated individuals. The box edges represent the interquartile range (75^th and 25^th percentile) with the horizontal line indicating the median. The whiskers extend to 1.5 times the IQR from the box edges. Individual data points beyond the ends of the whiskers are shown for RAB32 wild-type (WT); all individual data points are shown for RAB32 S71R. No statistically significant difference was observed in a linear regression adjusting for cohort (b = −2.7 years, two-tailed P = 0.38).

Extended Data Fig. 9 Western blots for all biological replicates.

a,b, Western blots for all three biological replicates of LRRK2 S1292 phosphorylation (a) and LRRK2 and RAB32 co-immunoprecipitation (b). Full-length blots are provided as Source Data.

Source data

Extended Data Table 1 Clinical characteristics for carriers of the RAB32 p.S71R variant

Full size table

Supplementary information

Supplementary Information

Supplementary Tables 1 and 2, Figs. 1 and 2, and Note.

Reporting Summary

Supplementary Data 1

Summary statistics for all gene-based association analyses.

Supplementary Data 2

Summary statistics for all single variant analyses.

Source data

Source Data Fig. 3

Uncropped western blots.

Source Data Extended Data Fig. 9

Uncropped western blots.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hop, P.J., Lai, D., Keagle, P.J. et al. Systematic rare variant analyses identify RAB32 as a susceptibility gene for familial Parkinson’s disease. Nat Genet 56, 1371–1376 (2024). https://doi.org/10.1038/s41588-024-01787-7

Download citation

Received: 06 December 2023
Accepted: 06 May 2024
Published: 10 June 2024
Issue Date: July 2024
DOI: https://doi.org/10.1038/s41588-024-01787-7

This article is cited by

Identification of Ser71Arg mutation in RAB32 gene in familial Parkinson’s disease from Southern Italy
- Monica Gagliardi
- Radha Procopio
- Andrea Quattrone
npj Parkinson's Disease (2025)
Genetic evidence for a link between frontotemporal dementia and Parkinson’s disease: the case of RAB32 Ser71Arg
- Andrea Panzavolta
- Francesco Cavallieri
- Chiara Cerami
Journal of Neurology (2025)
LRRK2 rare-variant per-domain genetic burden in Parkinson’s Disease: association confined to the kinase domain
- Sitki Cem Parlar
- Konstantin Senkevich
- Ziv Gan-Or
npj Parkinson's Disease (2025)
RAB32 variant is associated with PD susceptibility
- Heather Wood
Nature Reviews Neurology (2024)
Parkinson’s families project: a UK-wide study of early onset and familial Parkinson’s disease
- Clodagh Towns
- Zih-Hua Fang
- Raquel Real
npj Parkinson's Disease (2024)

Subjects

Abstract

Similar content being viewed by others

Main

Methods

Cohorts

Sequencing

Variant annotation

Sample quality control

Variant quality control

Single-variant analyses

Gene-based analyses

Sanger sequencing

Immunoprecipitation

Immortalized cell culture

Plasmids used for this study

Immunoprecipitation and western blot

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

Project MinE ALS Sequencing Consortium

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links