You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have some questions regarding the implementation of AugmentedSafetyDataCollator and its relationship to equation (2) in the paper:
In finetune.py line 98, I notice that num_shift_tokens in AugmentedSafetyDataCollator is initialized to 0 (default value).
The safety_augmentation data collator (AugmentedSafetyDataCollator) contains two types of input ids:
instruction + harmful_response
instruction + refusal
From my understanding, the first part of equation (2) should create something like: instruction + harmful_response[:k] + refusal
Given the current implementation where num_shift_tokens=0 and the structure of the two input ids, I'm unclear how equation (2) is actually implemented. Could you help clarify the relationship between these components? Thanks!
The text was updated successfully, but these errors were encountered:
Thanks for sharing your insightful work!
I have some questions regarding the implementation of
AugmentedSafetyDataCollator
and its relationship to equation (2) in the paper:In
finetune.py
line 98, I notice thatnum_shift_tokens
inAugmentedSafetyDataCollator
is initialized to 0 (default value).The safety_augmentation data collator (
AugmentedSafetyDataCollator
) contains two types of input ids:instruction + harmful_response
instruction + refusal
From my understanding, the first part of equation (2) should create something like:
instruction + harmful_response[:k] + refusal
Given the current implementation where
num_shift_tokens=0
and the structure of the two input ids, I'm unclear how equation (2) is actually implemented. Could you help clarify the relationship between these components? Thanks!The text was updated successfully, but these errors were encountered: