diff --git a/index.html b/index.html
index 3de50237..8a401ae2 100644
--- a/index.html
+++ b/index.html
@@ -77,7 +77,7 @@
Zero-Shot Clinical Trial Patient Matchi
- Stanford University
Under Review
+ Stanford University
NEJM AI 2024
*Indicates Equal Contribution
@@ -262,12 +262,13 @@
Interpretability
The results show that GPT-4 is able to provide legitimate rationales for most its decisions. When GPT-4 makes a correct eligibility decision (Figure 4), 89% of its rationales were judged as fully correct, 8% as partially correct, and 3% as incorrect. When GPT-4 made an incorrect eligibility decision (Figure 5), its rationales were split 67/8/25%.
- Figure 4: Clinician assessment of whether the rationales generated by GPT-4 given eligibility decision is evaluated as correct.
+ Figure 4a (top): Clinician assessment of the rationales generated by GPT-4 for its correct eligibility decisions.
+ Figure 4b (bottom): Clinician assessment of the rationales generated by GPT-4 for its incorrect eligibility decisions.
-
+