A hybrid knowledge and ensemble classification approach for prediction of venous thromboembolism

Susan Sabra, Khalid Mahmood Malik, Muhammad Afzal, Vian Sabeeh, Ahmad Charaf Eddine

Research output: Contribution to journalArticlepeer-review

5 Scopus citations


Clinical narratives such as progress summaries, lab reports, surgical reports, and other narrative texts contain key biomarkers about a patient's health. Evidence-based preventive medicine needs accurate semantic and sentiment analysis to extract and classify medical features as the input to appropriate machine learning classifiers. However, the traditional approach of using single classifiers is limited by the need for dimensionality reduction techniques, statistical feature correlation, a faster learning rate, and the lack of consideration of the semantic relations among features. Hence, extracting semantic and sentiment-based features from clinical text and combining multiple classifiers to create an ensemble intelligent system overcomes many limitations and provides a more robust prediction outcome. The selection of an appropriate approach and its interparameter dependency becomes key for the success of the ensemble method. This paper proposes a hybrid knowledge and ensemble learning framework for prediction of venous thromboembolism (VTE) diagnosis consisting of the following components: a VTE ontology, semantic extraction and sentiment assessment of risk factor framework, and an ensemble classifier. Therefore, a component-based analysis approach was adopted for evaluation using a data set of 250 clinical narratives where knowledge and ensemble achieved the following results with and without semantic extraction and sentiment assessment of risk factor, respectively: a precision of 81.8% and 62.9%, a recall of 81.8% and 57.6%, an F measure of 81.8% and 53.8%, and a receiving operating characteristic of 80.1% and 58.5% in identifying cases of VTE.

Original languageEnglish
Article numbere12388
JournalExpert Systems
Issue number1
StatePublished - Feb 1 2020


  • clinical decision support system
  • clinical text processing
  • ensemble classifier
  • semantic extraction
  • sentiment analysis
  • venous thromboembolism


Dive into the research topics of 'A hybrid knowledge and ensemble classification approach for prediction of venous thromboembolism'. Together they form a unique fingerprint.

Cite this