TY - JOUR
T1 - Natural language processing in biomedicine
T2 - A unifi ed system architecture overview
AU - Doan, Son
AU - Conway, Mike
AU - Phuong, Tu Minh
AU - Ohno-Machado, Lucila
N1 - Publisher Copyright:
© Springer Science+Business Media New York 2014.
PY - 2014
Y1 - 2014
N2 - In contemporary electronic medical records much of the clinically important data—signs and symptoms,symptom severity, disease status, etc.—are not provided in structured data fi elds but rather are encoded inclinician-generated narrative text. Natural language processing (NLP) provides a means of unlocking thisimportant data source for applications in clinical decision support, quality assurance, and public health.This chapter provides an overview of representative NLP systems in biomedicine based on a unifi ed architecturalview. A general architecture in an NLP system consists of two main components: backgroundknowledge that includes biomedical knowledge resources and a framework that integrates NLP tools toprocess text. Systems differ in both components, which we review briefl y. Additionally, the challenge facingcurrent research efforts in biomedical NLP includes the paucity of large, publicly available annotated corpora,although initiatives that facilitate data sharing, system evaluation, and collaborative work betweenresearchers in clinical NLP are starting to emerge.
AB - In contemporary electronic medical records much of the clinically important data—signs and symptoms,symptom severity, disease status, etc.—are not provided in structured data fi elds but rather are encoded inclinician-generated narrative text. Natural language processing (NLP) provides a means of unlocking thisimportant data source for applications in clinical decision support, quality assurance, and public health.This chapter provides an overview of representative NLP systems in biomedicine based on a unifi ed architecturalview. A general architecture in an NLP system consists of two main components: backgroundknowledge that includes biomedical knowledge resources and a framework that integrates NLP tools toprocess text. Systems differ in both components, which we review briefl y. Additionally, the challenge facingcurrent research efforts in biomedical NLP includes the paucity of large, publicly available annotated corpora,although initiatives that facilitate data sharing, system evaluation, and collaborative work betweenresearchers in clinical NLP are starting to emerge.
KW - Biomedicine
KW - Electronic medical record
KW - Machine learning method
KW - Natural language processing
KW - Rule-based learning method
KW - System architecture
KW - Unified Medical Language System
UR - http://www.scopus.com/inward/record.url?scp=84921899831&partnerID=8YFLogxK
U2 - 10.1007/978-1-4939-0847-9_16
DO - 10.1007/978-1-4939-0847-9_16
M3 - Article
C2 - 24870142
AN - SCOPUS:84921899831
SN - 1064-3745
VL - 1168
SP - 275
EP - 294
JO - Methods in Molecular Biology
JF - Methods in Molecular Biology
ER -