Fix bug in calculating score when output from LLM is broken #317

Yongtae723 · 2023-11-21T06:03:57Z

If generated answer is too stupid, evaluation of faithfull will be broken.

For example, if you change fake result into "I love christmass" ,which is completely nun sense answer on cell 10 on this notebook, the result of faithfullness will be 1. This is not good. You can try it out.

I debug the reason and showed like if the result is too stupid,
1st and 2nd output from LLM will be
There is no relevant statement that can be created from the given answer.
but ragas implementation is based the assumption there are "verdict: yes" or "verdict: no".

The way to solve this issue is not only one. for example, slitly modify template prompt for stupid answer is one example, but it can be affect the result.

So I make pr for my suggestion.

Maybe this PR will conflict my PR #307, If so I will fix conflict!

Thanks

shahules786 · 2023-11-21T11:58:03Z

Thanks @Yongtae723 , This makes sense. I'll check this while I modify faithfulness prompts + demonstrations as mentioned here.

Yongtae723 · 2023-11-21T12:07:52Z

alright, Make sense. Forcing json output might solve this issue. I am looking forward to hear your experiment result. And if possible, I would like to check it works as our expectation.

So if possible let me know your PR!

Also closes PRs #307 & #317

Yongtae723 · 2023-11-27T06:54:21Z

Thanks! and sorry for my late responce

I checked prompt and tested.
I found typo of prompt, so I created new PR(I guess) #338

Fix bug in calculating score when output from LLM is broken

0962c32

Yongtae723 mentioned this pull request Nov 21, 2023

modify first prompt of faithfullness #321

Closed

shahules786 mentioned this pull request Nov 24, 2023

fix: structure faithfulness output #333

Merged

jjmachan pushed a commit that referenced this pull request Nov 24, 2023

fix: structure faithfulness output (#333)

929414d

Also closes PRs #307 & #317

Yongtae723 closed this Nov 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix bug in calculating score when output from LLM is broken #317

Fix bug in calculating score when output from LLM is broken #317

Yongtae723 commented Nov 21, 2023

shahules786 commented Nov 21, 2023

Yongtae723 commented Nov 21, 2023

Yongtae723 commented Nov 27, 2023

Fix bug in calculating score when output from LLM is broken #317

Fix bug in calculating score when output from LLM is broken #317

Conversation

Yongtae723 commented Nov 21, 2023

shahules786 commented Nov 21, 2023

Yongtae723 commented Nov 21, 2023

Yongtae723 commented Nov 27, 2023