﻿<?xml version="1.0" encoding="UTF-8"?>
<ArticleSet>
  <Article>
    <Journal>
      <PublisherName>Tabriz University of Medical Sciences</PublisherName>
      <JournalTitle>Health Promotion Perspectives</JournalTitle>
      <Issn>2228-6497</Issn>
      <Volume>15</Volume>
      <Issue>1</Issue>
      <PubDate PubStatus="ppublish">
        <Year>2025</Year>
        <Month>05</Month>
        <DAY>06</DAY>
      </PubDate>
    </Journal>
    <ArticleTitle>Artificial intelligence survival models for identifying relevant risk factors for incident diabetes in Azar cohort population</ArticleTitle>
    <FirstPage>82</FirstPage>
    <LastPage>92</LastPage>
    <ELocationID EIdType="doi">10.34172/hpp.025.43105</ELocationID>
    <Language>EN</Language>
    <AuthorList>
      <Author>
        <FirstName>Neda</FirstName>
        <LastName>Gilani</LastName>
        <Identifier Source="ORCID">https://orcid.org/0000-0002-5399-0277</Identifier>
      </Author>
      <Author>
        <FirstName>Mohammadhossein</FirstName>
        <LastName>Somi</LastName>
        <Identifier Source="ORCID">https://orcid.org/0000-0002-0770-9309</Identifier>
      </Author>
      <Author>
        <FirstName>Farzaneh</FirstName>
        <LastName>Hamidi</LastName>
        <Identifier Source="ORCID">https://orcid.org/0000-0002-5413-9185</Identifier>
      </Author>
      <Author>
        <FirstName>Pasqualina</FirstName>
        <LastName>Santaguida</LastName>
        <Identifier Source="ORCID">https://orcid.org/0000-0001-7360-7362</Identifier>
      </Author>
      <Author>
        <FirstName>Elnaz</FirstName>
        <LastName>Faramarzi</LastName>
        <Identifier Source="ORCID">https://orcid.org/0000-0003-4128-433X</Identifier>
      </Author>
      <Author>
        <FirstName>Reza</FirstName>
        <LastName>Arabi Belaghi</LastName>
        <Identifier Source="ORCID">https://orcid.org/0000-0002-6989-9267</Identifier>
      </Author>
    </AuthorList>
    <PublicationType>Journal Article</PublicationType>
    <ArticleIdList>
      <ArticleId IdType="doi">10.34172/hpp.025.43105</ArticleId>
    </ArticleIdList>
    <History>
      <PubDate PubStatus="received">
        <Year>2024</Year>
        <Month>04</Month>
        <Day>09</Day>
      </PubDate>
      <PubDate PubStatus="accepted">
        <Year>2024</Year>
        <Month>12</Month>
        <Day>01</Day>
      </PubDate>
    </History>
    <Abstract>Background: This study aimed to identify some risk factors associated with time to diabetes type II events using artificial intelligence (AI) survival models (SM) in a population cohort from East Azerbaijan, Iran. Methods: Data from Azar-Cohort spanning from 2014 to 2020 was analyzed using the random forest (RF) variable selection method along with Cox regression to identify the most relevant risk factors associated with diabetes. We then developed prediction models using RF survival analysis. Lasso-variable selection and RF variable selection were used to select the most important variables. The concordance index (C-index) was used to evaluate the concordance of the prediction models. Results: Our LASSO-Cox regression identified six factors to be significantly associated with diabetes: age, mean corpuscular hemoglobin concentration (MCHC), waist circumference (WC), body mass index (BMI), use of sleep medication, and hypertension stage 1 and stage 2. The model included all variables with a C-index of 76.3%. In contrast, the RF analysis identified 21 important variables predicting a higher probability of having diabetes. Of those, WC, MCHC, triglyceride, and age were the most important predictors of diabetes. The RF model converged after 500 trees with an out-of-bag (OOB) of 0.28 and a C-index of 79.5%. Conclusion: RF machine learning algorithms and LASSO-Cox regression analyses consistently identified WC, hypertension, and MCHC as the main risk factors for developing diabetes. The RF approach demonstrated slightly better accuracy in predicting the likelihood of diabetes at different time points.  </Abstract>
    <ObjectList>
      <Object Type="keyword">
        <Param Name="value">Cohort study</Param>
      </Object>
      <Object Type="keyword">
        <Param Name="value">Diabetes mellitus</Param>
      </Object>
      <Object Type="keyword">
        <Param Name="value">Incidence</Param>
      </Object>
      <Object Type="keyword">
        <Param Name="value">Random forest</Param>
      </Object>
      <Object Type="keyword">
        <Param Name="value">Survival analysis</Param>
      </Object>
    </ObjectList>
  </Article>
</ArticleSet>