From c18f5f0d7674371edfac2b6f5815d72e2608b764 Mon Sep 17 00:00:00 2001
From: Michael Wornow Zero-Shot Clinical Trial Patient Matchi
@@ -262,12 +262,13 @@
The results show that GPT-4 is able to provide legitimate rationales for most its decisions. When GPT-4 makes a correct eligibility decision (Figure 4), 89% of its rationales were judged as fully correct, 8% as partially correct, and 3% as incorrect. When GPT-4 made an incorrect eligibility decision (Figure 5), its rationales were split 67/8/25%.
Interpretability
- Figure 4: Clinician assessment of whether the rationales generated by GPT-4 given eligibility decision is evaluated as correct.
+ Figure 4a (top): Clinician assessment of the rationales generated by GPT-4 for its correct eligibility decisions.
+ Figure 4b (bottom): Clinician assessment of the rationales generated by GPT-4 for its incorrect eligibility decisions.