Fix inconsistent device config in finetuning and serving yaml #25

carsonwang · 2024-01-03T10:18:37Z

closes #11

jiafuzha

LGTM

* [rlhf] adapt to ray 2.5.0 * [rlhf] add post_init for RewardModel * [rlhf] support neox reward model * [rlhf] adapt values dims * add tensorboard for rm trainer * remove duplicated code of reward model

update device value

28e2d3d

carsonwang requested a review from jiafuzha January 4, 2024 02:22

jiafuzha approved these changes Jan 4, 2024

View reviewed changes

carsonwang merged commit 91f3756 into intel:main Jan 4, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix inconsistent device config in finetuning and serving yaml #25

Fix inconsistent device config in finetuning and serving yaml #25

carsonwang commented Jan 3, 2024

jiafuzha left a comment

Fix inconsistent device config in finetuning and serving yaml #25

Fix inconsistent device config in finetuning and serving yaml #25

Conversation

carsonwang commented Jan 3, 2024

jiafuzha left a comment

Choose a reason for hiding this comment