University of Worcester Worcester Research and Publications

Validation of Electronic Health Record Phenotyping of Bipolar Disorder Cases and Controls

Castro, V.M., Minnier, J., Murphy, S.N., Kohane, I.S., Churchill, S.E., Gainer, V., Cai, T., Hoffnagle, A.G., Dai, Y., Block, S., Weill, S.R., Nadal-Vicens, M., Pollastri, A.R., Rosenquist, J.N., Goryachev, S., Ongur, D., Sklar, P., Perlis, R.H., Smoller, J.W., Lee, P.H., Stahl, E.A., Purcell, S.M., Ruderfer, D.M., Charney, A.W., Roussos, P., Pato, C., Pato, M., Medeiros, H., Sobel, J., Craddock, N., Jones, I., Forty, L., Di Florio, A., Green, E., Jones, Lisa ORCID:, Dunjewski, K., Landén, M., Hultman, C., Juréus, A., Bergen, S., Svantesson, O., McCarroll, S., Moran, J.L., Chambert, R.A. and Belliveau, R. (2014) Validation of Electronic Health Record Phenotyping of Bipolar Disorder Cases and Controls. The American Journal of Psychiatry, 172 (4). pp. 363-372. ISSN Print: 0002-953X Online: 1535-7228

Full text not available from this repository. (Request a copy)


Objective: The study was designed to validate use of elec-tronic health records (EHRs) for diagnosing bipolar disorder and classifying control subjects.
Method: EHR data were obtained from a health care system of more than 4.6 million patients spanning more than 20 years. Experienced clinicians reviewed charts to identify text features and coded data consistent or inconsistent with a diagnosis of bipolar disorder. Natural language processing was used to train a diagnostic algorithm with 95% specificity for classifying bipolar disorder. Filtered coded data were used to derive three additional classification rules for case subjects and one for control subjects. The positive predictive value (PPV) of EHR-based bipolar disorder and subphenotype di- agnoses was calculated against diagnoses from direct semi- structured interviews of 190 patients by trained clinicians blind to EHR diagnosis.
Results: The PPV of bipolar disorder defined by natural language processing was 0.85. Coded classification based on strict filtering achieved a value of 0.79, but classifications based on less stringent criteria performed less well. No EHR- classified control subject received a diagnosis of bipolar dis- order on the basis of direct interview (PPV=1.0). For most subphenotypes, values exceeded 0.80. The EHR-based clas- sifications were used to accrue 4,500 bipolar disorder cases and 5,000 controls for genetic analyses.
Conclusions: Semiautomated mining of EHRs can be used to ascertain bipolar disorder patients and control subjects with high specificity and predictive value compared with diagnostic interviews. EHRs provide a powerful resource for high-throughput phenotyping for genetic and clinical research.

Item Type: Article
Additional Information:

The full-text can be accessed via the official URL.

Uncontrolled Discrete Keywords: Bipolar disorder, electronic health record, phenotyping
Subjects: B Philosophy. Psychology. Religion > BF Psychology
R Medicine > RC Internal medicine > RC0321 Neuroscience. Biological psychiatry. Neuropsychiatry
Divisions: College of Health, Life and Environmental Sciences > School of Allied Health and Community
Related URLs:
Depositing User: Lisa Jones
Date Deposited: 13 Oct 2016 11:12
Last Modified: 17 Jun 2020 17:14

Actions (login required)

View Item View Item
Worcester Research and Publications is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software credits.