TY - GEN
T1 - Temporal modification of apriori to find seasonal variations between symptoms and diagnoses
AU - Shrestha, Aashara
AU - Fegaras, Leonidas
AU - Zikos, Dimitrios
N1 - Publisher Copyright:
© 2018 Association for Computing Machinery.
PY - 2018/6/26
Y1 - 2018/6/26
N2 - Medical data can be mined for patterns, which may be used to predict candidate diagnoses according to symptoms and other parameters of care. Our hypothesis is that the admission (initial) patient assessment, when combined with seasonal information can provide more accurate insights for the patient diagnosis. For instance, when cough is the symptom, the probability for flu could be higher during the winter (flu season). We hereby present a method to estimate the temporal variation of the probability for a diagnosis, when the initial patient assessment is known. In order to develop the model, we utilized a large synthetic medical claims dataset from the Centers for Medicare and Medicaid Services. We used the Apriori algorithm to calculate the support and confidence for each 'admission_diagnosis → final_diagnosis' itemset. For each itemset, 52 rules were generated, one for each week of a calendar year. The Apriori output was filtered so that only itemsets with the 'admission diagnosis' on the Left Hand Side(LHS) are extracted. We furthermore smoothened, using the Exponentially Weighted Moving Average (EWMA) algorithm, and then visualized the week-by-week variability of confidence, for any 'admission_diagnosis → final_diagnosis' pair of interest. With our approach, researchers can observe seasonal variations of the diagnosis element, and further study these variations for causal knowledge discovery.
AB - Medical data can be mined for patterns, which may be used to predict candidate diagnoses according to symptoms and other parameters of care. Our hypothesis is that the admission (initial) patient assessment, when combined with seasonal information can provide more accurate insights for the patient diagnosis. For instance, when cough is the symptom, the probability for flu could be higher during the winter (flu season). We hereby present a method to estimate the temporal variation of the probability for a diagnosis, when the initial patient assessment is known. In order to develop the model, we utilized a large synthetic medical claims dataset from the Centers for Medicare and Medicaid Services. We used the Apriori algorithm to calculate the support and confidence for each 'admission_diagnosis → final_diagnosis' itemset. For each itemset, 52 rules were generated, one for each week of a calendar year. The Apriori output was filtered so that only itemsets with the 'admission diagnosis' on the Left Hand Side(LHS) are extracted. We furthermore smoothened, using the Exponentially Weighted Moving Average (EWMA) algorithm, and then visualized the week-by-week variability of confidence, for any 'admission_diagnosis → final_diagnosis' pair of interest. With our approach, researchers can observe seasonal variations of the diagnosis element, and further study these variations for causal knowledge discovery.
KW - Apriori
KW - Association rule mining
KW - Clinical decision making
KW - Health informatics
KW - Seasonal variations
UR - http://www.scopus.com/inward/record.url?scp=85049892917&partnerID=8YFLogxK
U2 - 10.1145/3197768.3201562
DO - 10.1145/3197768.3201562
M3 - Conference contribution
AN - SCOPUS:85049892917
T3 - ACM International Conference Proceeding Series
SP - 490
EP - 494
BT - Proceedings of the 11th PErvasive Technologies Related to Assistive Environments Conference, PETRA 2018
PB - Association for Computing Machinery
T2 - 11th ACM International Conference on PErvasive Technologies Related to Assistive Environments, PETRA 2018
Y2 - 26 June 2018 through 29 June 2018
ER -