feat(model,eval) DPO training & human preference eval #338

BobbyL2k · 2024-02-13T11:14:39Z

Why this PR

This PR adds DPO training scripts and human preference evaluation notebook to repo

Changes

Added DPO training scripts and human preference evaluation notebook

Related Issues

Close #

Checklist

PR should be in the Naming convention
Assign yourself in to Assigneees
Tag related issues
Constants name should be ALL_CAPITAL, function name should be snake_case, and class name should be CamelCase
complex function/algorithm should have Docstring
1 PR should not have more than 200 lines changes (Exception for test files). If more than that please open multiple PRs
At least PR reviewer must come from the task's team (model, eval, data)

codecov · 2024-02-13T11:20:29Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (5fbee96) 64.16% compared to head (54425cb) 19.39%.
Report is 1 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##             main     #338       +/-   ##
===========================================
- Coverage   64.16%   19.39%   -44.78%     
===========================================
  Files          11       25       +14     
  Lines         427     1392      +965     
===========================================
- Hits          274      270        -4     
- Misses        153     1122      +969

Flag	Coverage Δ
unittests	`19.39% <ø> (-44.78%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

DPO training & human preference eval

54425cb

BobbyL2k assigned new5558 and ArthurMinovsky Feb 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(model,eval) DPO training & human preference eval #338

feat(model,eval) DPO training & human preference eval #338

BobbyL2k commented Feb 13, 2024

codecov bot commented Feb 13, 2024

feat(model,eval) DPO training & human preference eval #338

Are you sure you want to change the base?

feat(model,eval) DPO training & human preference eval #338

Conversation

BobbyL2k commented Feb 13, 2024

Why this PR

Changes

Related Issues

Checklist

codecov bot commented Feb 13, 2024

Codecov Report