Collect unpaired preference feedback #395

RobotSail · 2024-12-07T04:05:11Z

In order to further fine-tune language models and align them with human preferences, it's necessary to collect preference feedback on model responses. There are a few types of data points to collect: unpaired and paired.

Unpaired preference data is used for unpaired preference optimization, as described by the following paper: UPO: Unpaired Preference Optimization for Large Language Models.

In order to collect data for this form of fine-tuning, we want to introduce a thumbs up/down button that appears on each assistant response. When the user presses this button, we want to record the following information:

immediate model response
previous user message
conversation ID
model ID

For instance, consider how the UI appears in the following popular chat assistant:

Example of a response with thumbs up/down buttons:

Example of the thumbs up/down buttons:

This issue depends on #13 and #394

RobotSail mentioned this issue Dec 7, 2024

[Epic] Data Pipeline for RLHF Tuning #392

Open

4 tasks

vishnoianil added the enhancement label Dec 17, 2024

vishnoianil added this to UI Dec 17, 2024

vishnoianil added this to the release-1.2 milestone Dec 17, 2024

vishnoianil moved this to Backlog in UI Dec 17, 2024

vishnoianil added help wanted Extra attention is needed and removed enhancement labels Feb 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collect unpaired preference feedback #395

Collect unpaired preference feedback #395

RobotSail commented Dec 7, 2024 •

edited

Loading

Collect unpaired preference feedback #395

Collect unpaired preference feedback #395

Comments

RobotSail commented Dec 7, 2024 • edited Loading

RobotSail commented Dec 7, 2024 •

edited

Loading