Using "beam search" strategy while generating the responses #2534
Labels
🙋 help from community wanted
Open invitation for community members to contribute
🏋 PPO
Related to PPO
Hi
I am using
flan-t5-xl
to generate the output.When I use the function
ppo_trainer.generate(....)
, it gives me the desired output but I guess it is the top beam or the best output.I am trying to generate output for 4 beams (currently using this custom generate function):
Is it possible to generate output for multiple beams using the
ppo_trainer.generate(....)
function?The text was updated successfully, but these errors were encountered: