Early metabolic markers identify potential targets for the prevention of type 2 diabetes

Peddinti, Gopal; Cobb, Jeff; Yengo, Loic; Froguel, Philippe; Kravić, Jasmina; Balkau, Beverley; Tuomi, Tiinamaija; Aittokallio, Tero; Groop, Leif

doi:10.1007/s00125-017-4325-0

Early metabolic markers identify potential targets for the prevention of type 2 diabetes

Article
Open access
Published: 08 June 2017

Volume 60, pages 1740–1750, (2017)
Cite this article

Download PDF

You have full access to this open access article

Diabetologia Aims and scope Submit manuscript

Early metabolic markers identify potential targets for the prevention of type 2 diabetes

Download PDF

Gopal Peddinti ORCID: orcid.org/0000-0002-8767-968X¹^nAff2,
Jeff Cobb³,
Loic Yengo^4,5,6,7,
Philippe Froguel^4,5,6,8,
Jasmina Kravić⁹,
Beverley Balkau¹⁰,
Tiinamaija Tuomi^1,11,12,
Tero Aittokallio^1,13 &
…
Leif Groop^1,9

8828 Accesses
88 Citations
17 Altmetric
Explore all metrics

Abstract

Aims/hypothesis

The aims of this study were to evaluate systematically the predictive power of comprehensive metabolomics profiles in predicting the future risk of type 2 diabetes, and to identify a panel of the most predictive metabolic markers.

Methods

We applied an unbiased systems medicine approach to mine metabolite combinations that provide added value in predicting the future incidence of type 2 diabetes beyond known risk factors. We performed mass spectrometry-based targeted, as well as global untargeted, metabolomics, measuring a total of 568 metabolites, in a Finnish cohort of 543 non-diabetic individuals from the Botnia Prospective Study, which included 146 individuals who progressed to type 2 diabetes by the end of a 10 year follow-up period. Multivariate logistic regression was used to assess statistical associations, and regularised least-squares modelling was used to perform machine learning-based risk classification and marker selection. The predictive performance of the machine learning models and marker panels was evaluated using repeated nested cross-validation, and replicated in an independent French cohort of 1044 individuals including 231 participants who progressed to type 2 diabetes during a 9 year follow-up period in the DESIR (Data from an Epidemiological Study on the Insulin Resistance Syndrome) study.

Results

Nine metabolites were negatively associated (potentially protective) and 25 were positively associated with progression to type 2 diabetes. Machine learning models based on the entire metabolome predicted progression to type 2 diabetes (area under the receiver operating characteristic curve, AUC = 0.77) significantly better than the reference model based on clinical risk factors alone (AUC = 0.68; DeLong’s p = 0.0009). The panel of metabolic markers selected by the machine learning-based feature selection also significantly improved the predictive performance over the reference model (AUC = 0.78; p = 0.00019; integrated discrimination improvement, IDI = 66.7%). This approach identified novel predictive biomarkers, such as α-tocopherol, bradykinin hydroxyproline, X-12063 and X-13435, which showed added value in predicting progression to type 2 diabetes when combined with known biomarkers such as glucose, mannose and α-hydroxybutyrate and routinely used clinical risk factors.

Conclusions/interpretation

This study provides a panel of novel metabolic markers for future efforts aimed at the prevention of type 2 diabetes.

Metabolomics insights into early type 2 diabetes pathogenesis and detection in individuals with normal fasting glucose

Article 06 April 2018

Metabolomics based markers predict type 2 diabetes in a 14-year follow-up study

Article Open access 28 July 2017

Metabolomics for Improved Understanding and Prediction of Cardiometabolic Diseases—Recent Findings from Human Studies

Article 27 October 2015

Introduction

Type 2 diabetes is a major disease that affects more than 420 million individuals worldwide; if current trends continue, the number will surpass 700 million individuals by 2025 [1]. Predictive biomarkers are needed to allow physicians to identify and monitor individuals at high risk for the disease. Metabolomic profiling of the complete set of small-molecule metabolites allows for capturing physiological and pathophysiological changes in the body [2, 3]. Metabolomics has proved to be a rich source of markers for diabetes [4,5,6,7,8,9], and improved the prediction of type 2 diabetes incidence beyond clinical and biological markers [10]. For instance, untargeted plasma metabolomics measuring 447 metabolites in a large cohort of women from the TwinsUK study found metabolites associated with hyperglycaemia and type 2 diabetes [11], revealing a large set of potential metabolic markers including amino acids, carbohydrates, lipids, xenobiotics and unknowns, and highlighted an important role for the catabolism of branched chain amino acids (BCAAs) in type 2 diabetes. Another untargeted metabolomics study measured more than 4500 metabolites in a prospective cohort of 300 individuals who developed type 2 diabetes during 6 years follow-up and 300 matched control participants, and identified several metabolic alterations in lipid metabolism and sugars [12]. A recent meta-analysis of 19 prospective and 27 cross-sectional studies revealed the association of several metabolites with the incidence of prediabetes (i.e. impaired glucose tolerance, impaired fasting glucose, insulin resistance or impaired insulin sensitivity) and type 2 diabetes, including hexoses, aromatic amino acids, phospholipids and triacylglycerols, and confirmed the key role of BCAAs and aromatic amino acids in the prediction of type 2 diabetes [9].

Wang et al measured 61 metabolites and examined whether multi-metabolite panels could jointly predict the type 2 diabetes risk in 2422 normoglycaemic individuals followed for 12 years in the Framingham Offspring Study [13]. They showed that BCAAs and aromatic amino acids were significantly associated with the future risk of type 2 diabetes, and that the combination of isoleucine, tyrosine and phenylalanine predicted the risk. Gall et al used untargeted metabolomics in a cohort of 399 non-diabetic individuals from the RISC (Relationship of Insulin Sensitivity to Cardiovascular Risk) study, and identified α-hydroxybutyrate (α-HB) as an early biomarker for insulin resistance and glucose intolerance [14]. Using the entire RISC cohort and a long-term observational cohort of at-risk individuals in the Botnia Prospective Study (BPS), Ferrannini et al performed targeted profiling of α-HB and 1-linoleoyl glycerophosphocholine (L-GPC) and showed that these markers jointly predicted insulin resistance and glucose intolerance [15]. These studies indicate that alterations in blood metabolite concentrations presage the onset of type 2 diabetes and aid in the identification of at-risk individuals by adding predictive information over standard clinical markers.

The objectives of the present data-driven study were to systematically assess the added value of comprehensive metabolomics data in predicting type 2 diabetes risk using machine learning-based predictive modelling, and to examine whether an unbiased feature-selection approach could identify novel metabolic marker combinations that improve the predictive performance over known biomarkers and clinical risk factors. We performed serum metabolomics in a prospective, follow-up study cohort of 543 non-diabetic individuals from the BPS, 146 of whom developed type 2 diabetes during a 10 year follow-up period. Predictive modelling enabled us to accurately predict the future incidence of type 2 diabetes using a novel biomarker panel.

Methods

Study population

The BPS was initiated in 1990 on the west coast of Finland to identify genetic factors contributing to type 2 diabetes, and includes a cohort of 2770 non-diabetic individuals followed for 10 years (median 7.7 years), 150 of whom developed type 2 diabetes [16]. A subpopulation of this cohort comprising 543 participants, whose fasting serum samples were available for metabolomics analyses, was used in the current study. This subpopulation thus included 146 participants who had progressed to type 2 diabetes by the end of the follow-up period, and 397 individuals randomly selected from those who did not progress to type 2 diabetes, such that sex is balanced. The ethics committee of the Helsinki University Hospital approved the study protocols. All individuals gave their informed consent to participation in the study.

Fasting serum samples collected at baseline were used for metabolomic profiling. Sex, age, BMI, fasting glucose, fasting insulin, family history of type 2 diabetes, waist circumference, systolic BP (SBP), diastolic BP (DBP), total blood cholesterol, HDL-cholesterol and triacylglycerols were recorded at baseline. The level of physical activity, use of antihypertensive medication (i.e. diuretics, beta blockers, calcium blockers, ACE inhibitors, angiotensin II receptor type 2 receptor inhibitors or other BP medication) and incidence of cardiovascular disease (CVD) during the follow-up period were also recorded. Glucose values measured using an OGTT at the end of the follow-up period were used to define whether an individual had progressed to type 2 diabetes (termed ‘progressors’) or remained non-diabetic (termed ‘non-progressors’).

For replication of our results, we used untargeted metabolomics data from fasting plasma samples of 1044 participants in the DESIR (Data from an Epidemiological Study on the Insulin Resistance Syndrome) study [10] from central western France, of whom 231 progressed to type 2 diabetes during a 9 year follow-up period (electronic supplementary material [ESM] Table 1). The ethics committee for the Protection of Subjects for Biomedical Research of Bicêtre Hospital, France, approved the study protocols. All individuals gave written informed consent to their participation in the study.

Metabolomics

Samples were prepared using a single-extraction method. Global untargeted metabolomics was performed using three platforms (ultra-HPLC [UHPLC]-MS in electrospray ionisation-positive and -negative modes, and GC-MS in electrospray ionisation-positive mode) to semi-quantitatively measure a diverse set of 542 serum metabolites. Targeted metabolomics was performed using an isotope-dilution UHPLC-MS/MS assay for the absolute quantification of 26 metabolites. Metabolites were identified by automated spectral comparison with a standard library and missing values were imputed using minimum non-missing measurement [14, 17]. See ESM Methods for further details. The targeted and untargeted metabolomics data were further standardised to zero mean and unit variance per metabolite, and combined into a single data matrix containing 568 metabolite measurements from 543 samples. The DESIR validation data were similarly acquired [10].

Statistical analysis

Fisher’s exact test was used to compare sex, family history, physical activity, use of antihypertensive medication and the presence of CVD between progressors and non-progressors, while Welch’s two-sample t test was used to compare age, BMI, fasting glucose, fasting insulin, waist size, SBP, DBP, cholesterol, HDL-cholesterol and triacylglycerols. Individuals with missing measurements for any of the clinical factors were excluded from all of the analyses involving that particular factor.

The statistical association of each metabolite with type 2 diabetes risk was assessed using logistic regression, using progression to type 2 diabetes as the binary response variable and the metabolite as the independent variable. ORs and corresponding significance levels (p value) were calculated from the logistic regression. Q values were calculated to control for the false discovery rate (FDR) [18]. Results at q < 0.05 were considered significant. For each significant metabolite, to assess whether its association with progression to type 2 diabetes was independent of the clinical risk factors, we performed multivariable logistic regression using the clinical covariates as additional variables.

Predictive modelling

We used machine learning to build predictive models for future type 2 diabetes risk and to find the most predictive biomarker combinations. In contrast to univariate statistical analysis, predictive modelling uses the joint distribution of the metabolic features to build multivariate models, while employing model regularisation to prevent model overfitting and to enable generalisation to new individuals. Regularised least-squares (RLS) regression for binary risk classification was used to build the metabolome-wide predictive model. To select a minimal set of predictive metabolites, we used an efficient greedy feature-selection approach for RLS (GreedyRLS) [19].

To carefully assess the predictive performance of the RLS models beyond training data, we designed a repeated nested stratified cross-validation approach (ESM Fig. 1) [20]. Ten folds of outer cross-validation nested over ten folds of inner cross-validation were repeated 100 times, with stratified fold splitting to balance the numbers of progressors and non-progressors across the folds. The outer cross-validation estimated the prediction performance of the model, while the inner cross-validation selected the regularisation parameter and the linear coefficients. When applying GreedyRLS, we used the regularisation parameter selected in the outer cross-validation and the entire training data to determine the selected features. Repetition of the nested cross-validation ensured that the estimated prediction performance and the selected features were not due to any single random fold-split in the outer cross-validation. We reported the union of feature sets selected in 100 repetitions as the final biomarker panel.

Receiver operating characteristic (ROC) curves were also derived based on the repeated nested cross-validation. The mean of the AUC values was calculated from 100 ROC curves and the 95% CI for the AUC was calculated as 2.5th and 97.5th percentile values. DeLong’s test for correlated ROC curves was used to assess the pairwise differences between competing models [21]. The DESIR validation data were also predicted using model parameters obtained in 100 repeats and averaged to calculate the ROC curve. The 95% CI of the validation AUC was calculated using DeLong’s method [21]. R packages ROCR [22] and pROC [23] were used for ROC curve analyses.

We used integrated discrimination improvement (IDI) to evaluate whether metabolites improved type 2 diabetes risk prediction when combined with clinical risk factors [24]. To calculate IDI, the RLS-based risk scores were converted into risk probabilities by scaling with SD and applying logit transformation. The discrimination slope (DS) of a model provides a measure of its discriminative ability, similar to AUC. DS was calculated as the difference in the mean risk probability between progressors and non-progressors. IDI measures the improvement obtained by adding new predictors and was calculated as the difference in DS between models with and without the new predictors, and expressed as percentage improvement obtained in DS [24]. IDI is equivalent to the integration of the net reclassification improvement over all cut-offs for the risk probability.

Results

We performed comprehensive metabolomics experiments and predictive modelling in 543 individuals from the BPS, including 146 progressors to type 2 diabetes during a 10 year follow-up period. The progressors and non-progressors were balanced for sex (Table 1). All individuals had normal glucose and insulin levels, but borderline-high cholesterol at the beginning of the study. Progressors were older and had higher BMI, fasting glucose and fasting insulin levels, waist size, SBP, DBP, and triacylglycerols than non-progressors, while HDL-cholesterol showed the opposite trend and total cholesterol showed no difference (Table 1).

Table 1 Clinical characteristics of individuals from the BPS used in this study, for training predictive models

Full size table

Targeted metabolomics measured 26 metabolites and untargeted metabolomics detected 542 distinct metabolites (316 identified and 226 unidentified) in the serum samples. Metabolon (Durham, NC, USA) identifiers are used to refer to the unknown metabolites (e.g. X-13435).

Individual metabolites are associated with type 2 diabetes risk

Statistical analysis with logistic regression found that nine out of 568 serum metabolites were negatively associated and 25 metabolites were positively associated with progression to type 2 diabetes, after controlling for FDR (q < 0.05). All of these metabolites were associated with progression to type 2 diabetes independent of fasting glucose levels at baseline, physical activity and the future incidence of CVD (ESM Fig. 2a, b). Sixteen of the metabolite associations were significant even after accounting for risk factors such as age, sex, BMI, family history and fasting insulin (Fig. 1, Table 2) or glucose (ESM Fig. 2a, b) level at baseline.

Table 2 Metabolites associated with progression to type 2 diabetes at FDR q < 0.05

Full size table

Reduced levels of glutamine, histidine, α-tocopherol and the (E,E)-isomer of bilirubin at baseline were associated with an increased risk of type 2 diabetes, independent of the risk factors considered. Increased levels of glutamate, α-HB, valine, isoleucine, trehalose and several unknown metabolites were associated with progression to type 2 diabetes independent of the risk factors (Fig. 1).

The entire metabolomic profile predicts future progression to type 2 diabetes

We tested how accurately the metabolome could predict progression to type 2 diabetes by applying a binary classification based on RLS regression. The entire metabolomic profile consisting of 568 serum metabolites predicted progression to type 2 diabetes with an AUC of 0.77 (Fig. 2a). The reference, clinical-only model using RLS regression with only the clinical risk factors of sex, age, BMI, fasting insulin level and family history predicted type 2 diabetes with an AUC of 0.68. The difference in the predictive performance between the metabolomics-only and the clinical-only models was statistically significant (p = 0.0009, DeLong’s test). Finally, we combined the clinical risk factors and the metabolomic profile within a joint RLS predictive model. It predicted progression to type 2 diabetes with an AUC of 0.76, a similar accuracy as the metabolomics-only model (p = 0.23), but significantly better than the clinical-only model (p = 0.005). The clinical-only model resulted in a DS of 0.12. The combined model increased the DS to 0.19, resulting in an IDI of 58% (i.e. 58% improvement in DS) (Fig. 2c, d).

As the fasting glucose level at baseline is a known risk factor for type 2 diabetes, we added this into the clinical model as an additional covariate. Although adding glucose improved the clinical-only model (AUC = 0.70, DS = 0.14), the combined model showed significantly better performance (p = 0.023, IDI = 36%; ESM Fig. 3).

An additional clinical reference model that added fasting glucose, total cholesterol, HDL-cholesterol, triacylglycerols, SBP, DBP and waist circumference improved the clinical-only model (AUC = 0.71, DS = 0.15), although the combined model again remained significantly better (p = 0.04, IDI = 33%, ESM Fig. 4).

Specific metabolic biomarkers predict future progression to type 2 diabetes

To better interpret the predictive ability of the metabolome, we sought to identify the key metabolite features required for optimal prediction accuracy by applying GreedyRLS. In order to find the number of features required for optimal prediction performance, we evaluated GreedyRLS by varying the model size from one to 20 features, and thus determined that five metabolites were sufficient for the maximal AUC (ESM Fig. 5).

The five-metabolite signatures selected during 100 repetitions of GreedyRLS predicted type 2 diabetes with an average AUC of 0.75 (Fig. 2b), showing higher predictive accuracy than the reference clinical-only model (AUC = 0.68), although the difference was not significant (p = 0.18). However, combining the panel of all selected metabolite features with clinical variables led to the highest predictive performance (AUC = 0.78; see also ESM Results), showing significant improvement over the clinical-only model (p = 0.00019; DS = 0.2, IDI = 66.7%; Fig. 2c, e) as well as over the metabolite-only model (p = 0.0004). Combining the selected metabolites also significantly improved performance over the additional clinical models, namely the model that contained fasting glucose (p = 0.0016, IDI = 43%, ESM Fig. 3) and the model that contained fasting glucose, total cholesterol, HDL-cholesterol, triacylglycerols, SBP, DBP and waist size (p = 0.0025, IDI = 40%, ESM Fig. 4) as additional clinical covariates.

Among the biomarker panel, formed as the union of the metabolic predictors selected during 100 repetitions of GreedyRLS, three metabolites were associated with decreased type 2 diabetes risk: α-tocopherol, bradykinin (BK) hydroxyproline ([Hyp3]-BK) and X-13435; and four were associated with increased risk: α-HB, glucose, mannose and X-12063 (Fig. 3, Table 3). These metabolic predictors, except X-13435, were associated with progression to type 2 diabetes, independent of traditional risk factors as well as of physical activity, use of hypertension medication and future incidence of CVD (ESM Fig. 2c). Mannose showed high correlation with fasting glucose, while X-12063, [Hyp3]-BK and α-HB showed low but statistically significant correlation (ESM Table 2).

Table 3 Statistical association of multivariate predictive markers with progression to type 2 diabetes

Full size table

The metabolomics data from the DESIR study included semi-quantitative measurements of four of our seven metabolic markers, namely glucose, mannose, α-HB and α-tocopherol [10]. We used these metabolites and the clinical covariates of sex, age, BMI, family history and fasting insulin measured in the DESIR study samples to predict the progression of these individuals to type 2 diabetes (Fig. 4). The validation AUC values for the clinical-only and combined models were 0.76 and 0.84, respectively, and the selected metabolic markers significantly improved the prediction performance over the clinical-only model (p = 5.4 × 10⁻⁷; IDI = 31.6%).

Discussion

Using comprehensive metabolomics profiling, we have identified a novel multivariate panel of metabolic markers consisting of glucose, mannose, α-HB, α-tocopherol, [Hyp3]-BK, X-12063 and X-13435, whose concentrations in fasting serum samples predicted future progression to type 2 diabetes in an otherwise healthy, normoglycaemic population, years before the onset of type 2 diabetes (Fig. 3, Table 3). These metabolic markers significantly improved the prediction of progression towards type 2 diabetes, showing the added value of screening metabolites along with clinical risk factors.

Statistical association testing and machine learning-based predictive modelling identified metabolic changes that preceded type 2 diabetes. Statistical tests identified 34 significant metabolites, yet multivariate predictive models required only five metabolites for the optimal prediction of progression to type 2 diabetes. While the metabolite features identified using both approaches are well supported in type 2 diabetes literature, our novel contribution was in systematically assessing the predictive performance of the biomarker panel in type 2 diabetes risk prediction.

Statistical associations

Histidine, glutamine and the (E,E)-isomer of bilirubin were negatively associated with type 2 diabetes risk independent of clinical risk factors (ESM Fig. 2b). Histidine-mediated suppression of hepatic glucose production has previously been suggested as a potential target for the treatment of type 2 diabetes [25]. In a double-blind placebo-controlled trial in participants with type 2 diabetes, glutamine supplementation yielded positive results [26]. Bilirubin has previously been reported to be negatively correlated with progression to diabetic nephropathy in individuals with type 2 diabetes [27, 28].

Glutamate and trehalose were positively associated with type 2 diabetes risk independent of clinical risk factors (Table 2, Fig. 1, ESM Fig. 2a). Plasma glutamate levels are known to be elevated in several diseases characterised by chronic oxidative stress and inflammation. Furthermore, chronically high extracellular glutamate levels may directly or indirectly contribute to the progressive loss of beta cells in both type 1 and type 2 diabetes [29]. Although trehalose is widely regarded as a safe food ingredient even for individuals with diabetes, trehalose was associated with increased type 2 diabetes risk in this study. Similar results have been previously reported in an African-American population [30].

Predictive modelling

The machine learning model based on the entire metabolome accurately predicted the future incidence of type 2 diabetes. To derive a more interpretable model, we performed feature selection and identified the most predictive metabolic markers (Fig. 3, Table 3). Combining clinical variables with the selected markers significantly improved the predictive performance over the reference model with clinical variables alone, and led to the model with the highest predictive performance. Even after excluding glucose, the combined predictive model outperformed the clinical reference model that contained fasting glucose (p = 0.0066, ESM Fig. 6), showing that the predictive performance of the marker panel was not solely due to glucose. Excluding glucose as well as mannose from the marker panel resulted in a reasonable combined predictive model (AUC = 0.75) that outperformed the reference model (p = 0.04), further supporting the value of our novel markers.

Among the biomarkers identified using feature selection (Fig. 3, Table 3), three showed negative associations and four showed positive associations with type 2 diabetes risk. The negatively associated markers were [Hyp3]-BK, α-tocopherol and X-13435 and, to the best of our knowledge, this is the first metabolomics study revealing the value of these markers in predicting type 2 diabetes risk. The positively associated markers were glucose, mannose, α-HB and X-12063. Mechanisms by which mannose is associated with an increased risk of type 2 diabetes independently of glucose have previously been suggested [31]. Identifying the unknown markers X-13435 and X-12063, although non-trivial, may reveal potentially novel pathways associated with progression to type 2 diabetes.

BK

BK is a peptide that causes vasodilation associated with BP lowering and protection from CVD [32]. [Hyp3]-BK is a BK analogue in which the third amino acid, proline, is hydroxylated. In this study, [Hyp3]-BK was negatively associated with type 2 diabetes risk independent of CVD risk (p = 2.2 × 10⁻⁶). However, as BK was elevated in progressors, showing an opposite trend (data not shown), we additionally tested the association of the total amount of BK and [Hyp3]-BK with progression. The total BK level was also negatively associated with progression to type 2 diabetes (p = 0.004). Diabetes decreases the activity of the kallikrein–kinin system by reducing the synthesis of plasma prekallikrein and hence BK, resulting in endothelial dysfunction [33, 34]. The current study revealed reduced levels of total BK far before the onset of type 2 diabetes, indicating a potential early role for the kallikrein–kinin system or oxidative stress and DNA damage associated with reduced BK in progression to type 2 diabetes. Interestingly, physical exercise improves glucose uptake by skeletal muscle resulting in improved insulin sensitivity, an effect that is partially mediated by an increased BK concentration, suggesting a mechanism by which physical exercise would contribute to the prediction of type 2 diabetes [33, 35,36,37]. [Hyp3]-BK was associated with type 2 diabetes risk independently of physical activity, antihypertensive medication and CVD (ESM Fig. 2c).

α-Tocopherol

A reduction in α-tocopherol, the most biologically active form of vitamin E in humans, was seen in progressors, and it was selected in the biomarker panel. Although observational studies have previously indicated a protective effect of vitamin E supplementation on glycaemic control in type 2 diabetes, randomised controlled trials have not confirmed the effect [38,39,40].

The unknowns (X-13435 and X-12063)

The unknown metabolite X-12063 showed a strong association with progression towards type 2 diabetes, and was selected in the biomarker panel (Fig. 3). Although its identity is currently unknown, this metabolite has previously been noted to be significantly associated with insulin resistance and glucose intolerance [14]. X-13435, which was not identified in earlier metabolomics studies, did not show univariate statistical association, but predicted type 2 diabetes risk jointly with the other markers.

All of the metabolic markers were associated with progression to type 2 diabetes independently of fasting glucose (ESM Fig. 2c). Except for X-13435, they were all associated with progression to type 2 diabetes independently of CVD, physical activity and use of hypertension medication. While mannose was highly correlated with fasting glucose, [Hyp3]-BK and X-12063 showed statistically significant yet low correlations. Similarly, α-HB showed a relatively low correlation, although statistically significant, with fasting glucose (ESM Table 2).

Taken together, the statistical analysis and predictive modelling identified a variety of known metabolic changes associated with progression to type 2 diabetes (ESM Fig. 2). In agreement with a recent meta-analysis [9], our study identified changes in BCAAs (valine and isoleucine), amino acids (histidine, glutamine and glycine), sugars (glucose and mannose) and other metabolites (glutamate, α-HB and L-GPC).

Benchmarking of the predictive markers

According to our benchmarking results, our new biomarker panel performed better than previously published metabolic markers, namely α-HB and L-GPC [14, 15] and amino acids [13]. α-HB was associated with increased insulin resistance and glucose intolerance, whereas L-GPC was protective [14, 15]. Predictive modelling with α-HB and L-GPC revealed a high predictive performance (AUC = 0.72; ESM Fig. 7), when combined with clinical covariates.

BCAAs are associated with insulin resistance, and the combination of isoleucine and the amino acids tyrosine and phenylalanine has been reported to predict the risk of type 2 diabetes 12 years before disease onset [13]. Consistent with previous studies, joint modelling of isoleucine, tyrosine, phenylalanine and clinical covariates showed high predictive performance (AUC = 0.71; ESM Fig. 7). As these amino acids are consistently found in many studies of prediabetes (i.e. impaired glucose tolerance, impaired fasting glucose, insulin resistance or impaired insulin sensitivity) and type 2 diabetes [9], they may have high potential for routine use as predictive biomarkers, and further translational research is needed to facilitate their clinical use.

Limitations of the study

Distinct predictive signatures may be discovered in different studies due to differences in the metabolites measured, or differences in the genetic and environmental background of the study population. The relatively large overlap of our biomarker panel with known markers of type 2 diabetes, however, suggests that our results are robust and stable. Replication of our findings in the DESIR data also shows that the predictive model trained using Finnish individuals generalised to independent French individuals, despite the potential variability due to the demographic difference.

α-HB was quantitatively measured in the BPS dataset, but using an untargeted platform in the DESIR dataset. Moreover, three of the metabolic markers, [Hyp3]-BK, X-12063 and X-13435, were not measured in the DESIR study. Despite these differences, we confirmed the high predictive performance and added predictive value of the selected metabolic markers in the independent study.

Conclusions

Progressors and non-progressors have different metabolic profiles years before they develop overt type 2 diabetes. In this study, a combination of known markers such as glucose, mannose and α-HB, and novel markers such as α-tocopherol, [Hyp3]-BK, X-12063 and X-13435, was found to accurately predict progression to type 2 diabetes. Interestingly, the negative association of [Hyp3]-BK with progression to type 2 diabetes highlights a possible mechanism by which interventions such as exercise could contribute to the prevention of type 2 diabetes.

Abbreviations

BCAA:: Branched chain amino acid
BK:: Bradykinin
BPS:: Botnia Prospective Study
CVD:: Cardiovascular disease
DESIR:: Data from an Epidemiological Study on the Insulin Resistance Syndrome
DBP:: Diastolic BP
DS:: Discrimination slope
FDR:: False discovery rate
GreedyRLS:: Greedy feature selection for regularised least squares
α-HB:: α-Hydroxybutyrate
[Hyp3]-BK:: Bradykinin hydroxyproline
IDI:: Integrated discrimination improvement
L-GPC:: 1-Linoleoyl glycerophosphocholine
RISC:: Relationship of Insulin Sensitivity to Cardiovascular Risk
RLS:: Regularised least squares
ROC:: Receiver operating characteristic
SBP:: Systolic BP
UHPLC:: Ultra-HPLC

References

NCD Risk Factor Collaboration (NCD-RisC) (2016) Worldwide trends in diabetes since 1980: a pooled analysis of 751 population-based studies with 4.4 million participants. Lancet 387:1513–1530
Article Google Scholar
Wishart DS (2016) Emerging applications of metabolomics in drug discovery and precision medicine. Nat Rev Drug Discov 15:473–484
Article CAS PubMed Google Scholar
Goodacre R, Vaidyanathan S, Dunn WB, Harrigan GG, Kell DB (2004) Metabolomics by numbers: acquiring and understanding global metabolite data. Trends Biotechnol 22:245–252
Article CAS PubMed Google Scholar
Bain JR, Stevens RD, Wenner BR, Ilkayeva O, Muoio D, Newgard CB (2009) Metabolomics applied to diabetes research: moving from information to knowledge. Diabetes 58:2429–2443
Article CAS PubMed PubMed Central Google Scholar
Wang-Sattler R, Yu Z, Herder C et al (2012) Novel biomarkers for pre-diabetes identified by metabolomics. Mol Syst Biol 8:615
Article PubMed PubMed Central Google Scholar
Suhre K, Meisinger C, Döring A et al (2010) Metabolic footprint of diabetes: a multiplatform metabolomics study in an epidemiological setting. PLoS One 5:e13953
Article PubMed PubMed Central Google Scholar
Floegel A, Stefan N, Yu Z et al (2013) Identification of serum metabolites associated with risk of type 2 diabetes using a targeted metabolomic approach. Diabetes 62:639–648
Article CAS PubMed PubMed Central Google Scholar
Klein MS, Shearer J (2016) Metabolomics and type 2 diabetes: translating basic research into clinical application. J Diabetes Res 2016:3898502
Article PubMed Google Scholar
Guasch-Ferré M, Hruby A, Toledo E et al (2016) Metabolomics in prediabetes and diabetes: a systematic review and meta-analysis. Diabetes Care 39:833–846
Article PubMed PubMed Central Google Scholar
Yengo L, Arredouani A, Marre M et al (2016) Impact of statistical models on the prediction of type 2 diabetes using non-targeted metabolomics profiling. Mol Metab 5:918–925
Article CAS PubMed PubMed Central Google Scholar
Menni C, Fauman E, Erte I et al (2013) Biomarkers for type 2 diabetes and impaired fasting glucose using a nontargeted metabolomics approach. Diabetes 62:4270–4276
Article CAS PubMed PubMed Central Google Scholar
Drogan D, Dunn WB, Lin W et al (2015) Untargeted metabolic profiling identifies altered serum metabolites of type 2 diabetes mellitus in a prospective, nested case control study. Clin Chem 61:487–497
Article CAS PubMed Google Scholar
Wang TJ, Larson MG, Vasan RS et al (2011) Metabolite profiles and the risk of developing diabetes. Nat Med 17:448–453
Article PubMed PubMed Central Google Scholar
Gall WE, Beebe K, Lawton KA et al (2010) α-Hydroxybutyrate is an early biomarker of insulin resistance and glucose intolerance in a nondiabetic population. PLoS One 5:e10883
Article PubMed PubMed Central Google Scholar
Ferrannini E, Natali A, Camastra S et al (2013) Early metabolic markers of the development of dysglycaemia and type 2 diabetes and their physiological significance. Diabetes 62:1730–1737
Article CAS PubMed PubMed Central Google Scholar
Lyssenko V, Almgren P, Anevski D et al (2005) Predictors of and longitudinal changes in insulin sensitivity and secretion preceding onset of type 2 diabetes. Diabetes 54:166–174
Article CAS PubMed Google Scholar
Evans AM, DeHaven CD, Barrett T, Mitchell M, Milgram E (2009) Integrated, nontargeted ultrahigh performance liquid chromatography/electrospray ionization tandem mass spectrometry platform for the identification and relative quantification of the small-molecule complement of biological systems. Anal Chem 81:6656–6667
Article CAS PubMed Google Scholar
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol 57:289–300
Google Scholar
Pahikkala T, Okser S, Airola A, Salakoski T, Aittokallio T (2012) Wrapper-based selection of genetic features in genome-wide association studies through fast matrix operations. Algorithm Mol Biol 7:11
Article Google Scholar
Krstajic D, Buturovic LJ, Leahy DE, Thomas S (2014) Cross-validation pitfalls when selecting and assessing regression and classification models. J Cheminform 6:10
Article PubMed PubMed Central Google Scholar
DeLong ER, DeLong DM, Clarke-Pearson DL (1988) Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics 44:837–845
Article CAS PubMed Google Scholar
Sing T, Sander O, Beerenwinkel N, Lengauer T (2005) ROCR: visualizing classifier performance in R. Bioinformatics 21:3940–3941
Article CAS PubMed Google Scholar
Robin X, Turck N, Hainard A et al (2011) pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinforma 12:77
Article Google Scholar
Pencina MJ, D’Agostino RB, D’Agostino RB, Vasan RS (2008) Evaluating the added predictive ability of a new marker: from area under the ROC curve to reclassification and beyond. Stat Med 27:157–172
Article PubMed Google Scholar
Kimura K, Nakamura Y, Inaba Y et al (2013) Histidine augments the suppression of hepatic glucose production by central insulin action. Diabetes 62:2266–2277
Article CAS PubMed PubMed Central Google Scholar
Mansour A, Mohajeri-Tehrani MR, Qorbani M, Heshmat R, Larijani B, Hosseini S (2015) Effect of glutamine supplementation on cardiovascular risk factors in patients with type 2 diabetes. Nutrition 31:119–126
Article CAS PubMed Google Scholar
Hull TD, Agarwal A (2014) Bilirubin: a potential biomarker and therapeutic target for diabetic nephropathy. Diabetes 63:2613–2616
Article CAS PubMed PubMed Central Google Scholar
Riphagen IJ, Deetman PE, Bakker SJL et al (2014) Bilirubin and progression of nephropathy in type 2 diabetes: a post hoc analysis of RENAAL with independent replication in IDNT. Diabetes 63:2845–2853
Article CAS PubMed Google Scholar
Davalli AM, Perego C, Folli FB (2012) The potential role of glutamate in the current diabetes epidemic. Acta Diabetol 49:167–183
Article CAS PubMed Google Scholar
Yu B, Zheng Y, Alexander D, Morrison AC, Coresh J, Boerwinkle E (2014) Genetic determinants influencing human serum metabolome among African Americans. PLoS Genet 10:e1004212
Article PubMed PubMed Central Google Scholar
Lee S, Zhang C, Kilicarslan M et al (2016) Integrated network analysis reveals an association between plasma mannose levels and insulin resistance. Cell Metab 24:172–184
Article CAS PubMed Google Scholar
Palkhiwala SA, Frishman WH, Warshafsky S (2001) Bradykinin for the treatment of cardiovascular disease. Heart Dis 3:333–339
Article CAS PubMed Google Scholar
Simões HG, Asano RY, Sales MM et al (2013) Type 2 diabetes elicits lower nitric oxide, bradykinin concentration and kallikrein activity together with higher DesArg(9)-BK and reduced post-exercise hypotension compared to non-diabetic condition. PLoS One 8:e80348
Article PubMed PubMed Central Google Scholar
Stabler T, Kenjale A, Ham K, Jelesoff N, Allen J (2010) Potential mechanisms for reduced delivery of nitric oxide to peripheral tissues in diabetes mellitus. Ann N Y Acad Sci 1203:101–106
Article CAS PubMed Google Scholar
Kolodka T, Charles ML, Raghavan A et al (2014) Preclinical characterization of recombinant human tissue kallikrein-1 as a novel treatment for type 2 diabetes mellitus. PLoS One 9:e103981
Article PubMed PubMed Central Google Scholar
Asano RY, Sales MM, Browne RA et al (2014) Acute effects of physical exercise in type 2 diabetes: a review. World J Diabetes 5:659–665
Article PubMed PubMed Central Google Scholar
Taguchi T, Kishikawa H, Motoshima H et al (2000) Involvement of bradykinin in acute exercise-induced increase of glucose uptake and GLUT-4 translocation in skeletal muscle: studies in normal and diabetic humans and rats. Metab Clin Exp 49:920–930
Article CAS PubMed Google Scholar
Kataja-Tuomola MK, Kontto JP, Männistö S, Albanes D, Virtamo JR (2010) Effect of alpha-tocopherol and beta-carotene supplementation on macrovascular complications and total mortality from diabetes: results of the ATBC study. Ann Med 42:178–186
Article CAS PubMed Google Scholar
Kataja-Tuomola M, Sundell JR, Männistö S et al (2008) Effect of α-tocopherol and β-carotene supplementation on the incidence of type 2 diabetes. Diabetologia 51:47–53
Article CAS PubMed Google Scholar
Xu R, Zhang S, Tao A, Chen G, Zhang M (2014) Influence of vitamin E supplementation on glycaemic control: a meta-analysis of randomised controlled trials. PLoS One 9:e95008
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

All of the study participants are thanked for making this research possible. The skillful assistance of the Botnia Study Group is gratefully acknowledged.

Author information

Gopal Peddinti
Present address: , Tietotie 2, P. O. Box 1000, FIN-02044 VTT, Espoo, Finland

Authors and Affiliations

Institute for Molecular Medicine Finland (FIMM), Nordic EMBL Partnership for Molecular Medicine, University of Helsinki, Helsinki, Finland
Gopal Peddinti, Tiinamaija Tuomi, Tero Aittokallio & Leif Groop
Metabolon Inc., Durham, NC, USA
Jeff Cobb
CNRS UMR8199, Pasteur Institute of Lille, Lille, France
Loic Yengo & Philippe Froguel
European Genomic Institute for Diabetes (EGID), FR-3508, Lille, France
Loic Yengo & Philippe Froguel
Lille University, Lille, France
Loic Yengo & Philippe Froguel
Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
Loic Yengo
Department of Genomics of Common Disease, School of Public Health, Imperial College London, Hammersmith Hospital, London, UK
Philippe Froguel
Lund University Diabetes Center, Lund, Sweden
Jasmina Kravić & Leif Groop
CESP, Faculty of Medicine – University Paris-South; Faculty of Medicine – University Versailles-St Quentin; Inserm U1018, University Paris-Saclay, Villejuif, France
Beverley Balkau
Department of Endocrinology, Abdominal Centre, Helsinki University Central Hospital, Helsinki, Finland
Tiinamaija Tuomi
Folkhalsan Research Center and Research Programs Unit, Diabetes and Obesity, University of Helsinki, Helsinki, Finland
Tiinamaija Tuomi
Department of Mathematics and Statistics, University of Turku, Turku, Finland
Tero Aittokallio

Authors

Gopal Peddinti
View author publications
You can also search for this author in PubMed Google Scholar
Jeff Cobb
View author publications
You can also search for this author in PubMed Google Scholar
Loic Yengo
View author publications
You can also search for this author in PubMed Google Scholar
Philippe Froguel
View author publications
You can also search for this author in PubMed Google Scholar
Jasmina Kravić
View author publications
You can also search for this author in PubMed Google Scholar
Beverley Balkau
View author publications
You can also search for this author in PubMed Google Scholar
Tiinamaija Tuomi
View author publications
You can also search for this author in PubMed Google Scholar
Tero Aittokallio
View author publications
You can also search for this author in PubMed Google Scholar
Leif Groop
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gopal Peddinti.

Ethics declarations

Data availability

Data are available upon request from the authors.

Funding

This study was supported by Academy of Finland grants 265966 to GP, 269862, 272437 and 295504 to TA, and 263401 and 267882 to LG. LG was supported by European Research Council grant GA 269045. The BPS has been financially supported by grants from the Sigrid Juselius Foundation, Folkhalsan Research Foundation, Nordic Center of Excellence in Disease Genetics, European Union Framework Programme (EU FP6) project EXGENESIS, Finnish Diabetes Research Foundation, Foundation for Life and Health in Finland, Finnish Medical Society, Helsinki University Central Hospital Research Foundation, Perklén Foundation, Ollqvist Foundation and Narpes Health Care Foundation. The study has also been supported by the Municipal Health Care Center and Hospital in Jakobstad and Health Care Centers in Vaasa, Narpes and Korsholm. The research leading to the validation study was supported by funding from ANR-10-LABX-46, ANR-10-EQPX-07-01, the European Research Council GEPIDIAB – 294785 and the Qatar Foundation (PF).

Duality of interest

The authors declare that there is no duality of interest associated with this manuscript.

Contribution statement

GP designed the study, analysed data and drafted the manuscript. JC performed metabolomics and revised the manuscript. LY, PF and BB collected and analysed validation data. JK performed data processing and management. LG and TT conceived, designed and coordinated the Botnia study. TA, TT and LG critically reviewed the manuscript for important intellectual content. All authors contributed significantly to the data analysis, and reviewed and approved the final manuscript. GP and LG are the guarantors of the study.

Electronic supplementary material

ESM

(PDF 4635 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Peddinti, G., Cobb, J., Yengo, L. et al. Early metabolic markers identify potential targets for the prevention of type 2 diabetes. Diabetologia 60, 1740–1750 (2017). https://doi.org/10.1007/s00125-017-4325-0

Download citation

Received: 17 February 2017
Accepted: 11 May 2017
Published: 08 June 2017
Issue Date: September 2017
DOI: https://doi.org/10.1007/s00125-017-4325-0

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Early metabolic markers identify potential targets for the prevention of type 2 diabetes

Abstract

Aims/hypothesis

Methods

Results

Conclusions/interpretation

Similar content being viewed by others

Metabolomics insights into early type 2 diabetes pathogenesis and detection in individuals with normal fasting glucose

Metabolomics based markers predict type 2 diabetes in a 14-year follow-up study

Metabolomics for Improved Understanding and Prediction of Cardiometabolic Diseases—Recent Findings from Human Studies

Introduction

Methods

Study population

Metabolomics

Statistical analysis

Predictive modelling

Results

Individual metabolites are associated with type 2 diabetes risk

The entire metabolomic profile predicts future progression to type 2 diabetes

Specific metabolic biomarkers predict future progression to type 2 diabetes

Discussion

Statistical associations

Predictive modelling

BK

α-Tocopherol

The unknowns (X-13435 and X-12063)

Benchmarking of the predictive markers

Limitations of the study

Conclusions

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Data availability

Funding

Duality of interest

Contribution statement

Electronic supplementary material

ESM

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation