This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
medalign [2023/08/24 17:29] nigam |
medalign [2023/08/31 12:37] (current) jfries |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records ====== | ====== MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records ====== | ||
- | |||
The ability of large language models (LLMs) to follow natural language instructions with human-level fluency suggests many opportunities in healthcare to reduce administrative burden and improve quality of care. However, evaluating LLMs on realistic text generation tasks for healthcare remains challenging. Existing question answering datasets for electronic health record (EHR) data fail to capture the complexity of information needs and documentation burdens experienced by clinicians. To address these challenges, we introduce MedAlign, a benchmark dataset of 983 natural language instructions for EHR data. MedAlign is curated by 15 clinicians (7 specialities), | The ability of large language models (LLMs) to follow natural language instructions with human-level fluency suggests many opportunities in healthcare to reduce administrative burden and improve quality of care. However, evaluating LLMs on realistic text generation tasks for healthcare remains challenging. Existing question answering datasets for electronic health record (EHR) data fail to capture the complexity of information needs and documentation burdens experienced by clinicians. To address these challenges, we introduce MedAlign, a benchmark dataset of 983 natural language instructions for EHR data. MedAlign is curated by 15 clinicians (7 specialities), | ||
+ | |||
+ | See our preprint for more details. | ||
+ | |||
+ | **[[https:// | ||
+ | |||
+ |