TY - JOUR
T1 - Scaling Clinical Trial Matching Using Large Language Models
T2 - 8th Machine Learning for Healthcare Conference, MLHC 2023
AU - Wong, Cliff
AU - Zhang, Sheng
AU - Gu, Yu
AU - Moung, Christine
AU - Abel, Jacob
AU - Usuyama, Naoto
AU - Weerasinghe, Roshanthi
AU - Piening, Brian
AU - Naumann, Tristan
AU - Bifulco, Carlo
AU - Poon, Hoifung
N1 - Publisher Copyright:
© 2023 C. Wong et al.
PY - 2023
Y1 - 2023
N2 - Clinical trial matching is a key process in health delivery and discovery. In practice, it is plagued by overwhelming unstructured data and unscalable manual processing. In this paper, we conduct a systematic study on scaling clinical trial matching using large language models (LLMs), with oncology as the focus area. Our study is grounded in a clinical trial matching system currently in test deployment at a large U.S. health network. Initial findings are promising: out of box, cutting-edge LLMs, such as GPT-4, can already structure elaborate eligibility criteria of clinical trials and extract complex matching logic (e.g., nested AND/OR/NOT). While still far from perfect, LLMs substantially outperform prior strong baselines and may serve as a preliminary solution to help triage patient-trial candidates with humans in the loop. Our study also reveals a few significant growth areas for applying LLMs to end-to-end clinical trial matching, such as context limitation and accuracy, especially in structuring patient information from longitudinal medical records.
AB - Clinical trial matching is a key process in health delivery and discovery. In practice, it is plagued by overwhelming unstructured data and unscalable manual processing. In this paper, we conduct a systematic study on scaling clinical trial matching using large language models (LLMs), with oncology as the focus area. Our study is grounded in a clinical trial matching system currently in test deployment at a large U.S. health network. Initial findings are promising: out of box, cutting-edge LLMs, such as GPT-4, can already structure elaborate eligibility criteria of clinical trials and extract complex matching logic (e.g., nested AND/OR/NOT). While still far from perfect, LLMs substantially outperform prior strong baselines and may serve as a preliminary solution to help triage patient-trial candidates with humans in the loop. Our study also reveals a few significant growth areas for applying LLMs to end-to-end clinical trial matching, such as context limitation and accuracy, especially in structuring patient information from longitudinal medical records.
UR - http://www.scopus.com/inward/record.url?scp=85177801804&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:85177801804
SN - 2640-3498
VL - 219
SP - 846
EP - 862
JO - Proceedings of Machine Learning Research
JF - Proceedings of Machine Learning Research
Y2 - 11 August 2023 through 12 August 2023
ER -