Kidney GenePrioritiSation - KidneyGPS

This platform summarizes information on each of 5906 genes overlapping the 424 loci identified for eGFRcrea based on a GWAS meta-analysis of UK Biobank data and CKDGen consortium data (n=1,201,909) [Stanzick et al. Nat. Commun. 2021] . Focussed on European ancestry (n=1,004,040) , 634 independent signals across the 424 loci were identified using approximate conditioned analysis. For each variant in these 634 signals, the posterior probability of association (PPA) was computed and, for each signal, a 99% credible set of variants was derived (i.e. smallest set of variants with >99% cumultative PPA). Credible (set) variants are considered the most likely variants to drive the association signal (particularly those with high PPA).

Gene Search

Search for genes overlapping any of the 424 loci:

Specification of functional evidence for the searched gene (s) displayed in GPS table:


Specification of additional information


Restrict results to credible variants with a minimum PPA* of:

*PPA: posterior probability of the eGFRcrea association (max. PPA 1.0)


          

GPS Table



Protein-relevant variants with predicted deleteriousness:



eQTLs in glomerular tissue (NEPTUNE, or Sheng et al [Nat.Genet. 2021]):



eQTLs in tubulo-interstitial tissue (NEPTUNE, or Sheng et al [Nat.Genet. 2021]):



eQTLs in kidney-cortex tissue (GTEx):



sQTLs in kidney-cortex tissue (GTEx):



eQTLs in other tissues (GTEx):



sQTLs in other tissues (GTEx):



Kidney phenotypes in mouse:



Kidney phenotypes in human:



Summary of loci and signals containing the searched gene(s):



List of all credible variants in the locus containing the searched gene:

Shown are results from GWAS meta-analyses in European ancestry.



Locus Zoom Plot:

SNP Search

Search for SNPs which are associated with log(eGFRcrea) at P<5E-8 (all ancestries, unconditioned, n=1,201,909) or for SNPs being a credible variant for any of the 634 signals (European ancestry, conditioned, n=1,004,040):

Search options:

eGFRcrea Association (P<5E-8 in all-ancestry GWAS meta-analyis, unconditioned):


Credible variant(s):


Protein-relevance and predicted deleteriousness:


eQTL in glomerular tissue (NEPTUNE, or Sheng et al [Nat.Genet. 2021]):


eQTL in tubulo-interstitial tissue (NEPTUNE, or Sheng et al [Nat.Genet. 2021]):


eQTL in kidney-cortex tissue (GTEx):


sQTL in kidney-cortex tissue (GTEx):


eQTL in other tissues (GTEx):


sQTL in other tissues (GTEx):


Locus Zoom Plot:

Gene Prioritisation - Overview




Region search

Release history

KidneyGPS 1.2.0 [2022-10] - integration of ADTKD genes, DM-status interaction and eGFRcrea decline association; loosening of the CADD-Phred cutoff for protein-altering variants with a clear functional cosequence

KidneyGPS 1.1.2 [2022-06] - small layout changes

KidneyGPS 1.1.1 [2022-04] - minor fixes

KidneyGPS 1.1.0 [2022-03]- integration of eQTL data from Susztaklab

first publication KidneyGPS 1.0 [2022-03]


Data Sources:


Association with eGFRcrea:

Genetic loci and genes within these loci are based on a GWAS meta-analysis for eGFRcrea of UK Biobank data and CKDGen consortium data (n=1,201,909). Detailed information on the selection process can be found here. A GWAS-meta-analysis restricted to individuals of European-ancestry (n=1,004,040) was used to identify independent association signals and to calculate posterior probabilities of association (PPA) for all variants in each signal. The 99% credible variant set of variants in each signal contains with a 99% % probability the causal variant, under the assumption that there is one causal variant per association signal and that this variant is included in the analysis.


Association with other phenotypes:

eGFRcys & BUN

GWAS meta-analyses were also performed for eGFR estimated from serum cystatin C (eGFRcys, n=460,826) and blood urea nitrogen (BUN, n=852,678). KidneyGPS provides the information if the locus lead variant (variant with the smallest association p-value in a locus) is nominal significantly associated with eGFRcys or BUN with concordant effect directions. Summary statistics of these analyses can be downloaded here.

Interaction with diabetes status

Diabetes mellitus (DM) is a risk factor for kidney failure. A GWAS meta-analysis for eGFRcrea conducted separatly for individuals with or without DM ( nDM =178,691, nnoDM =1,296,113) by Winkler et al. identified 7 loci with significant DM/noDM difference. 5 of these locis showed a more pronounced effect on eGFR in DM versus noDM (DM>NoDM), one locus had a DM-only effect and one locus a noDM-only effect. Further information on the impact of diabetes status on the genetic eGFRcrea effect sizes can be found in the original publication: Winkler et al. Commun. Biol. 2022 . Variants identified by this study were mapped to eGFRcrea signals in KidneyGPS via overlap, or strong correlation with the signal index variant identified by Stanzick et al.

Association with eGFRcrea decline

Progressive eGFR-decline can lead to kidney failure, necessitating dialysis or transplantation. Hence, Gorski et al. [Kidney Int. 2022] searched for genetic association with annual eGFR-decline using 62 longitudinal studies in 343,339 individuals. Associated variants were identified by three approaches: First, a genome-wide screen on eGFR-decline unadjusted for eGFR-baseline revealed two significantly ( (Pdecline < 5 x 10-8 ) associated variants within the UMOD-PDILT locus.Second, a candidate approach among the 263 lead variants for eGFRcrea from Wuttke et al. [Nat. Genet. 2019] identified two associated variants (Bonferroni corrected: Pdecline < 0.05/263 = 1.90 x 10-4 ). Third, a genome-wide screen for association with eGFR-decline adjusted for eGFRcrea at baseline revealed five variants, that were also associated (Bonferroni corrected: Pdecline < 0.05/12 = 4.17 x 10-3 ) with eGFRcrea decline unadjusted. The identified C15orf54 signal maps to a second signal in this locus and is thus not included in our GPS. We integrated these identified variants, when they resided in eGFRcrea signal or showed strong correlation with the signal index variant identified by Stanzick et al.


CADD:

The combined annotation dependend depletion (CADD) score is a measurement of the deleteriousness of a genetic variant. By integrating multiple annotations, it contrasts variants that survived natural selection with simulated mutations. CADD evaluated ~8.6 billion SNPs and the CADD-Phred Score used on this website represents the rank of variant compared to all annotated variants. Variants with the coding and non-coding consequences "stop-gained", "stop-lost", "missense", "canonical splice", "noncoding change", "synonymous" or "splice-site" are not restricted regarding their CADD-Phred Score. Variants with "other" consequences are filtered for a CADD-Phred Score 15, which restricts our analysis to the 3.2% most deleterious variants. Further, the analysis is restricted to variants within the affected gene as overlap with eQTLs and sQTLs should be minimized to avoid overscoring particular genes and variants. For additional information regarding CADD, please vistit the CADD website.

Used version: v1.6 [2020-03-23]


eQTL and sQTL data:

All credible variants were searched in expression quantitative trait loci (eQTL) databases. Three sources for eQTL data were used:

NEPTUNE

eQTL data from the NEPTUNE study includes cis- eQTLs, which are variants that influence expression of genes within a 1Mb region centred around the variant. The association between a variant and the expression of a gene was deemed to be significant if the false dicovery rate (FDR) was <0.05. This eQTL data was obtained from glomerular and tubulo-interstitial tissue. Further information about the NEPTUNE study can be found on the webpage of the study and on the NephQTL browser.

Version from [2017-09-25]

Susztaklab (Sheng et al)

The Susztaklab also provides comprehensive kidney omics data. We integrated the eQTL data from glomerular und tubulo-interstitial tissue published by Sheng et al. (Sheng, X. et al., Nature Genetics, 2021).

GTEx

In contrast to the other two eQTL sources, the GTEx project is not restricted to kidney tissue. Furthermore, additional splicing altering variants (sQTLs) were investigated. Thus, the here integrated GTEx data includes cis- eQTL and -sQTL information from 48 different tissues with a mapping window of 1Mb up- and downstream of the transcription start site. Further information about GTEx can be found here.

Used version: GTEx Release v7 [2017-09-05]


Mouse phenotypes:

Information on genes with kidney-relevant phenotypes in mice origin from the Mouse Genome Informatics database (MGI). This includes all phenotypes subordinate to "abnormal kidney morphology" (MP:0002135) and "abnormal kidney physiology" (MP:0002136). Further information how this data was collected can be found on the MGI webpage.

Version from [2020-06-03]


Human phenotypes:

We used three sources to identify genes causing genetic disorders with kidney phenotype in human:

OMIM

The Online Mendelian Inheritance in Man (OMIM) database was queried for phenotype entries subordinate to the clinical synopsis class "kidney". Diseases with "kidney"-phenotype entries being: "normal kidneys", "normal renal ultrasound at ages 4 and 7 (in two family)", "no kidney disease", "no renal disease; normal renal function", "normal renal function; no kidney disease" and "no renal findings" were manually excluded. Be aware that OMIM entries missing a clinical synopsis entry are not included in kidneyGPS regardless of a potential kidney involvement. Further information on the diseases can be found at the OMIM webpage.

Version from [2020-08-07]

Groopman et al.

A list of 625 genes associated with Mendelian forms of kidney and genitourinary disease was published by Groopman et al. in 2019 in the New England Journal of Medicine. The original article "Diagnostic Utility of Exome Sequencing for Kidney Disease" can be found here. Please notice that not all 625 genes are included in any eGFRcrea locus and thus cannot be found in kidneyGPS.

Wopperer al.

Autosomal Dominant Tubulointerstitial Kidney Disease (ADTKD) is a heriditary kidney-disease normaly caused by mutations in at least one of five genes ( UMOD, MUC1, REN, HNF1B, SEC61A1 ) and leads to kidney failure in midadulthood. However, Wopperer et al. identified 27 putative novel ADTKD genes, of which 9 are located within an eGFRcrea associated locus. Disease type of known ADTKD genes is stated as "confirmed ADTKD" in the "Kidney phenotypes in human" section and as "putative ADTKD" for the novel genes. The original publication can be found here.


Citation

Did you use KidneyGPS for a publication? We would appreciate if you cite us: "KidneyGPS: an easily accessible web application to prioritize kidney function genes and variants based on evidence from genome-wide association studies" and the original data sources appropriately.


Contact

If you have any questions not answered by the "Documentation & Help" section, please contact: Kira Stanzick (kira-julia.stanzick@klinik.uni-regensburg.de)