Fix bug in calculating score when output from LLM is broken #317
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
If generated answer is too stupid, evaluation of faithfull will be broken.
For example, if you change fake result into "I love christmass" ,which is completely nun sense answer on cell 10 on this notebook, the result of faithfullness will be 1. This is not good. You can try it out.
I debug the reason and showed like if the result is too stupid,
1st and 2nd output from LLM will be
There is no relevant statement that can be created from the given answer.
but ragas implementation is based the assumption there are "verdict: yes" or "verdict: no".
The way to solve this issue is not only one. for example, slitly modify template prompt for stupid answer is one example, but it can be affect the result.
So I make pr for my suggestion.
Maybe this PR will conflict my PR #307, If so I will fix conflict!
Thanks