lancet-header

Preprints with The Lancet is part of SSRN´s First Look, a place where journals identify content of interest prior to publication. Authors have opted in at submission to The Lancet family of journals to post their preprints on Preprints with The Lancet. The usual SSRN checks and a Lancet-specific check for appropriateness and transparency have been applied. Preprints available here are not Lancet publications or necessarily under review with a Lancet journal. These preprints are early stage research papers that have not been peer-reviewed. The findings should not be used for clinical or public health decision making and should not be presented to a lay audience without highlighting that they are preliminary and have not been peer-reviewed. For more information on this collaboration, see the comments published in The Lancet about the trial period, and our decision to make this a permanent offering, or visit The Lancet´s FAQ page, and for any feedback please contact preprints@lancet.com.

Predicting the Diagnosis of HIV and Sexually Transmitted Infections Among Men Who Have Sex with Men Using Different Machine Learning Approaches

24 Pages Posted: 8 Apr 2020

See all articles by Yining Bao

Yining Bao

Xi'an Jiaotong University (XJTU) - China-Australia Joint Research Center for Infectious Diseases

Nicholas A. Medland

Alfred Health - Melbourne Sexual Health Centre

Christopher K. Fairley

Xi'an Jiaotong University (XJTU) - China-Australia Joint Research Centre for Infectious Diseases; Alfred Health - Melbourne Sexual Health Centre; Harvard University - Department of Immunology and Infectious Diseases

Jinrong Wu

Royal Victorian Eye and Ear Hospital - Centre for Eye Research Australia

Xianwen Shang

Royal Victorian Eye and Ear Hospital - Centre for Eye Research Australia

Eric P. F. Chow

Alfred Health - Melbourne Sexual Health Centre; Monash University - Central Clinical School

Xianglong Xu

Xi'an Jiaotong University (XJTU) - China-Australia Joint Research Centre for Infectious Diseases

Zongyuan Ge

Monash University

Xun Zhuang

Nantong University

Lei Zhang

Xi'an Jiaotong University (XJTU) - China-Australia Joint Research Center for Infectious Diseases

More...

Abstract

Background: We aimed to develop models and evaluate their performance of machine learning approaches in predicting the diagnosis of HIV and sexually transmitted infections (STIs) based on a large retrospective cohort of Australian men who have sex with men (MSM).

Methods: We collected demographic, clinical, behavioural and laboratory information from the clinic records of 21273 MSM who attended Melbourne Sexual Health Centre (MSHC), Australia between 2011-2017. We limited the analysis of three STIs (syphilis, gonorrhoea, chlamydia) to the period of January 2015 to December 2017. We compared the accuracy for predicting the diagnosis of HIV and three STIs using four machine learning approaches against a multivariable logistic regression (MLR) model.

Findings: HIV was diagnosed in 436/18505 MSM (436 diagnoses/58121 consultations), syphilis in 741/13820 MSM (810 diagnoses/38490 consultations), gonorrhoea in 3258/10802 MSM (4382 diagnoses/25011 consultations), and chlamydia in 2836/7708 MSM (3918 diagnoses/13926 consultations). Machine learning approaches more accurately predicted each infection than MLR. Gradient boosting machine (GBM) was the most accurate and achieved the highest area under the receiver operator characteristic curve for HIV (76·3%) and STIs (syphilis, 85·8%; gonorrhoea, 75·5%; chlamydia, 68·0%), followed by extreme gradient boosting (71·1%, 82·2%, 70·3%, 66·4%), random forest (72·0%, 81·9%, 67·2%, 64·3%), deep learning (75·8%, 81·0%, 67·5%, 65·4%), and MLR (69·8%, 80·1%, 67·2%, 63·2%). The trained GBM models demonstrated that the ten greatest predictors collectively explained 62·7-73·6% of variations in predicting the diagnosis of HIV/STIs. Among which, STIs symptoms, past syphilis infection, age, time living in Australia, frequency of condom use with casual male sexual partners during receptive anal sex and the number of casual male sexual partners in the past 12 months were predictors most commonly identified by the models.

Interpretation: Machine learning approaches are advantageous over multivariable logistic regression models in predicting the diagnosis of HIV/STIs.

Funding Statement: Australian NHMRC Leadership Investigator Grant (GNT1172900)

Declaration of Interests: LZ is supported by the National Natural Science Foundation of China (8191101420); Thousand Talents Plan Professorship for Young Scholars (3111500001); Xi'an Jiaotong University Young Talent Support Program; Xi’an Jiaotong University Basic Research and Profession Grant (xtr022019003). CKF is supported by an Australian NHMRC Leadership Investigator Grant (GNT1172900). EPFC is supported by an Australian National Health and Medical Research Council (NHMRC) Emerging Leadership Investigator Grant (GNT1172873). XZ is supported by National Science and Technology Major Project of China (2018ZX10721102); The key Project of Philosophy and Social Sciences Research in Jiangsu Education Department of China (2018SJZDI123); Nantong Municipal Bureau of Science and Technology, China (MS12018001, HS2016002 ). All other authors declare no competing interests.

Ethics Approval Statement: The datasets were completely de-identified and not re-identifiable. Ethical approval was granted by the Alfred Hospital Ethics Committee, Australia (project number: 124/18).

Keywords: Machine learning, Diagnosis prediction, HIV, Sexually transmitted infections

Suggested Citation

Bao, Yining and Medland, Nicholas A. and Fairley, Christopher K. and Wu, Jinrong and Shang, Xianwen and Chow, Eric P. F. and Xu, Xianglong and Ge, Zongyuan and Zhuang, Xun and Zhang, Lei, Predicting the Diagnosis of HIV and Sexually Transmitted Infections Among Men Who Have Sex with Men Using Different Machine Learning Approaches (3/3/2020). Available at SSRN: https://ssrn.com/abstract=3550064 or http://dx.doi.org/10.2139/ssrn.3550064

Yining Bao (Contact Author)

Xi'an Jiaotong University (XJTU) - China-Australia Joint Research Center for Infectious Diseases

Xi’an
China

Nicholas A. Medland

Alfred Health - Melbourne Sexual Health Centre

Melbourne
Australia

Christopher K. Fairley

Xi'an Jiaotong University (XJTU) - China-Australia Joint Research Centre for Infectious Diseases

Xi'an
China

Alfred Health - Melbourne Sexual Health Centre

Melbourne
Australia

Harvard University - Department of Immunology and Infectious Diseases

Cambridge, MA
United States

Jinrong Wu

Royal Victorian Eye and Ear Hospital - Centre for Eye Research Australia

Melbourne, Victoria
Australia

Xianwen Shang

Royal Victorian Eye and Ear Hospital - Centre for Eye Research Australia

Melbourne, Victoria
Australia

Eric P. F. Chow

Alfred Health - Melbourne Sexual Health Centre

Melbourne
Australia

Monash University - Central Clinical School

Clayton
Australia

Xianglong Xu

Xi'an Jiaotong University (XJTU) - China-Australia Joint Research Centre for Infectious Diseases

Xi'an
China

Zongyuan Ge

Monash University

23 Innovation Walk
Wellington Road
Clayton, Victoria 3800
Australia

Xun Zhuang

Nantong University

40 Qingnian E Rd
Chongchuan Qu, Nantong Shi
Jiangsu Sheng, Jiangsu 226000
China

Lei Zhang

Xi'an Jiaotong University (XJTU) - China-Australia Joint Research Center for Infectious Diseases ( email )

Xi’an
China