Welcome to the PUFA Canalization web application!

The PUFA Canalization web application brings you a tool for exploring the intricate connections between polyunsaturated fatty acid (PUFA) metabolites and diseases.

We delve into four specific realms of elucidating PUFA metabolite-disease associations:

Disease-Metabolite Level Associations
Disease ~ Z-score Metabolite + age + sex + age^2 + PC1 + ... + PC10

Disease-Metabolite PRS Associations
Disease ~ PRS Metabolite + age + sex + age^2 + PC1 + ... + PC10

Mendelian Randomization supported metabolite-disease associations
Metabolite (exposure) ~ Disease (outcome)

Canalization of Metabolite PRS x Obesity on disease prevalence
Disease ~ Metabolite PRS x Obesity (Defined by BMI and WHR)

Background

Here, we demonstrate an integrative PheWAS approach to establish high-confidence, suggestive metabolite-disease associations for three fatty acid metabolites, omega-3 fatty acids, omega-6 fatty acids, and docosahexaenoic acid for 1,326 disease endpoints. Suggestive metabolite-disease associations were established if there was concordant direction of effect and significance for metabolite level and genetic risk score for the metabolite.

Methods

[1] Cohort design

Participants were ascertained from the UK Biobank (UKB), a cohort study of approximately 500,000 individuals from across the United Kingdom and including multiple ancestries, although predominantly European ancestry. For this analysis, previously imputed genotype and phenotype data were utilized. 409,148 individuals after sample QC were considered. There are up to 121,643 individuals in the UKB with reported metabolite levels. Of this we extracted only individuals passing the sample QC, which resulted in a total sample size of 101,793. Metabolite levels were extracted for ω3FA (field ID: 23444), ω6FA (field ID: 23445) and DHA (field ID: 23450). For individuals with more than one reporting of the metabolite level, the median of the levels was taken. ICD-10 summary diagnosis codes were extracted from UKBB to create case and control cohorts using the case inclusion and control exclusion criteria for each disease-specific phenotype code (phecode) using the phecode mappings (https://phewascatalog.org). There was a total of 1,755 disease end points to assess; however, only diseases with at least 50 cases were analyzed, which resulted in evaluation of 1,326 diseases.

[2] Disease-metabolite associations

A metabolite Z-score for the metabolite level was evaluated using the following formula: z=(x-μ)/σ, where x is the median metabolite level, μ is the mean of the median metabolite levels and σ is the standard deviation of the median metabolite level. Logistic regression models were implemented to assess the association between each Z-scoreMetabolite and disease phecode (case vs. control). Age, sex, age2 and the first ten global principal components were included in the model as covariates. Model structure: Disease status ~ Zscore-Metabolite + age + sex + age^2 + PC1 + ⋯ + PC10

[3] Disease-metabolite PRS associations

The Bayesian approach to PRS calculation, PRScs, was used, estimating posterior SNP effect sizes under continuous shrinkage (CS) priors for each GWAS summary statistic using the UK BioBank European LD reference panel (https://www.dropbox.com/s/t9opx2ty6ucrpib/ldblk_ukbb_eur.tar.gz). The inferred posterior effect sizes were then used to generate the PRS across chromosomes using PLINK’s score function and then summed for each individual. Logistic regression models were performed to assess the association between each scaled PGSMetabolite and disease phecode (case vs. control). Age, sex, age2 and the first ten global principal components were included in the model as covariates. Model structure: Disease status ~ PGS-Metabolite + age + sex + age^2 + PC1 + ⋯ + PC10

[4] Metabolite ~ disease Mendelian randomization associations

There were 178 unique significant diseases across the three fatty acid metabolites. Each of the 178 diseases was manually mapped by its name to a well-powered GWAS summary statistic available in Open GWAS. Two-sample Mendelian randomization was performed to assess the causal association between the three metabolites as exposures and the 178 diseases as outcomes. A nominally significant p-value threshold of 0.05 was applied using the Inverse Variance Weighted (IVW) MR method to determine significant associations. Because the suggestive fatty acid metabolite-disease associations were all protective, we only assessed MR associations with OR < 1. The OR for the suggestive and causal associations for each metabolite were then compared.

[5] Metabolite PRS x obesity associations

A set of disease-metabolite associations identified by the aforementioned interaction models assessing PUFA metabolites and weight measurements on disease, were further used to assess the potential events of canalization of the body weight measurements on PUFA metabolites and the impact of it on disease. Specifically, type 2 diabetes and ω3 fatty acids, obstructive chronic bronchitis and ω3FA, type 2 diabetes and DHA, chronic obstructive bronchitis and DHA, and diaphragmatic hernia and DHA. The cohort was divided into two cohorts for each body weight measurement, BMI and WHR defining obese and non-obese by standard cutoffs. The BMI and WHR obese and non-obese groupings were the same as in the previous section. The prevalence for the disease was compute for 100 bins of the PRSMetabolite for the obese and non-obese body weight measurement cohorts. To quantify the canalization, we computed delta observed, which is the difference between the right and left tail differences of the body weight obese and non-obese groups as further described in Nagpal et.al. We also computed delta expected, which is again the difference between the right and left tail differences but simulating the disease prevalence for 10 iterations, which was used in compute delta departure, a scaled estimate of the departure between observed delta and expected delta. More information on this approach is explained in Nagpal et.al.

[6] Assessing significance and concordance

A multiple testing p-value threshold of 6.28×10-6 (0.05/(1,326*6)) was applied for assessing the significance of the metabolite-disease non-genetic and genetic associations. Only the significant associations for non-genetic and genetic associations are demonstrated on this web app. A significant Mendelian randomization association was identified if there was a p < 0.05 (nominal significance) for the Inverse Variance Weighted (IVW) method. The IVW results are demonstrated on this website in the Mendelian randomization section. For the Metabolite PRS x obesity associations, only the metabolite x obesity associations are portrayed for the diseases with suggestive support (having significant p-value and concordant OR for metabolite level and metabolite PRS associations) for at least one of the three metabolites.

Disease-metabolite level association results

Disease-metabolite PRS association results

Metabolite ~ Disease Mendelian randomization association results

Metabolite PRS x obesity association results

Have questions about the PUFA Canalization web app?

Please email:

Dr. Courtney Astore or Professor Greg Gibson