JAMIA Publication:The Potential and Pitfalls of using a LLM as a Clinical Assistant

Predicting Length-of-Stay & Risk of Mortality for ICU Patients

Pangaea Data Awarded Top Tier Co-Sell Partnership Status by Microsoft

Pangaea Named ‘Digital Solution of the Year’ by UK Government’s Department for International Trade

How can Pharma & Healthcare Collaborate in a Financially Sustainable & Scalable Manner to Improve Outcomes?

Go back

Phenotyping in Clinical Text with Unsupervised Numerical Reasoning for Patient Stratification

This publication was selected to be presented at the AAAI 2022 conference. It has also been published in the Experimental Biology and Medicine (EBM) Journal and Multimodal AI in Healthcare Publication.

Profiling a patient using the phenotypes from clinical text has various applications in the healthcare domain, including patient stratification for hard-to-diagnose diseases. One of the key challenges in phenotyping is numerical reasoning which involves determining a phenotype based on numerical measurements. This is an open problem, and current state-of-the-art generic phenotyping methods perform poorly. Our study addresses this issue by presenting a novel unsupervised methodology that substantially outperforms the generic phenotyping methods. Our methodology is helpful for our research community as it requires no supervision, saving a lot of cost and time, as well as it can be generalized to other biomedical tasks requiring numerical reasoning. We demonstrate the impact of the work by building patient stratification systems for several diseases by using these phenotypes which can improve accuracy and substantially lessen the manual effort and time involved in screening of patients. In addition, the work provides a new solution to improve clinical data quality by imputing missing values into structured data.

Request Publication

About the Authors

Dr. Jingqing Zhang
Head of AI
Prof. YiKe Guo
Co-Founder & CSO