Enhancements #55

thiswillbeyourgithub · 2024-12-04T17:42:13Z

Hi!

I'm the dev who made a research fork of repeng to explore some questions about what's possible with repeng and repeng-like techniques. I first mentioned my ideas in these messages

The project is still ongoing but as I kept adding more and more little enhancements it felt a good idea to try to upstream some of them.

Notably, added a utils.py file with function make datasets, joblib caching for the activations, support for chat templates (with auto correction), l1/l2 norm, documentation, more controls on which layer to trigger, numpy v2 compatibility.

Note that I couldn't run the test file because of import issues with flash attention.

I also did not add any modules to the requirements yet.

I mainly wanted your feedback on this.

Also thanks a lot for making repeng, it's really nice and really allowed me to get my hands dirty!

Commits:

add file settings.py to store settings
add a utils.py file that contains a make_dataset function
move DatasetEntry to utils.py
feat: give more controls to how the user can select the layers to modify
feat: add joblib memory for caching model activations
fix: numpy v2 compatibility
feat: add arg norm_type when training that allows setting l1 or l2 (or other) norm
feat: add function to accept chat template as inputs + autocorrect
fix: forgot an import
minor: add tqdm for getting activations and applying the new directions
perf: potentially faster one liner to tokenize
feat: use flag LOW_MEMORY to reduce the amount of memory needed when storing the arrays before computing directions
fix: imports
update default model in example from mistral 7B v0.1 to v0.3
docs: add more details on how to load the model, including quantization, avoiding OOM, etc
docs: use chat messages in example
docs: in example, show how to login for models that require it
docs: add missing declaration of tokenizer
fix: in example, move the tokens directly to the correct device
minor: changed default in the example
docs: add link to github issue about OOM for gguf files
docs: add more examples of models
fix: autocorrect chat templates

allows to set verbosity, low_memory flags etc Signed-off-by: thiswillbeyourgithub <[email protected]>

Signed-off-by: thiswillbeyourgithub <[email protected]>

now the layer_ids arg can be either a list of int (the index of the layers), or "all" for all layers, "middle" for the middle half, "only_middle" for only one layer that is at the middle, or ranges in the form "0.3-0.7" (=layers from 30% of the depth to 70%) Signed-off-by: thiswillbeyourgithub <[email protected]>

Signed-off-by: thiswillbeyourgithub <[email protected]>

should close vgel#51 Signed-off-by: thiswillbeyourgithub <[email protected]>

…r other) norm Signed-off-by: thiswillbeyourgithub <[email protected]>

'autocorrect' meaning that we check that each message is indeed present in the output after applying the chat template. This was needed as there's a lot of cases where models (including official releases from GAFAM!) have sketchy implementation that drop system prompts silently etc Signed-off-by: thiswillbeyourgithub <[email protected]>

Signed-off-by: thiswillbeyourgithub <[email protected]>

…storing the arrays before computing directions Signed-off-by: thiswillbeyourgithub <[email protected]>

Signed-off-by: thiswillbeyourgithub <[email protected]>

…on, avoiding OOM, etc Signed-off-by: thiswillbeyourgithub <[email protected]>

Signed-off-by: thiswillbeyourgithub <[email protected]>

vgel · 2024-12-14T05:15:08Z

This is really cool! My first thought is there's probably too much going on here for one PR, but let me look it over more closely and figure out how we can split this up. I've also been considering adding a utils file for a similar reason as you--I might make a PR for that soon-ish and tag you in.

vgel · 2024-12-14T05:15:30Z

I'm also currently migrating the library from poetry to uv, so no rush on adding dependencies!

thiswillbeyourgithub · 2024-12-14T09:12:40Z

This is really cool! My first thought is there's probably too much going on here for one PR, but let me look it over more closely and figure out how we can split this up. I've also been considering adding a utils file for a similar reason as you--I might make a PR for that soon-ish and tag you in.

Glad you lile it! I take this opportunity to thank you once again for making repeng, it's been super helpful to get me started on many ideas I had.

I'm fine with splitting this up but I do care about my contribution being credited as I'm a self taught medical student (soon psychiatry resident!) so I want my interests and contributions to appear on my github for exposition :)

In any case I keep breaking and unbreaking my fork as I'm continuously experimenting (since yesterday I've been playing around with filtering samples that are retrievable after a 2 cluster kmeans on the umap projection and I feel good about this one!). Appart from the read_representation method I think the code is mostly stable. I'm sometimes unsure about my chat templating thingy but that might be because of errors in the original templates of their repo too so would appreciate a second look on this if you have the bandwith.

Also I couldn't run the tests on my machine, I'm thinking it would really help to know if my modifications are stable and with no side effects

edit: actually I was able to run the tests. But as I made extensive changes I'll keep them so far in my fork.

vgel · 2025-01-08T23:35:31Z

definitely, will absolutely make sure you're credited on anything that gets merged :-)

vgel · 2025-01-08T23:43:30Z

here, a good initial thing to split off would be the ability to pass a string for layer_ids in ControlModel.__init__ . i think we should have three options:

a list / range object, like we currently have
"all"
"middle", which should keep the middle 2/3 (instead of the middle half as it does now)

then in the docstring, we should suggest people use "middle" as the default. i don't think we should support None, because people should be explicit about what layers are being controlled. and i think "only_middle" and the range stuff are marginal enough that we don't need to support them by default--people can always calculate it themselves if they need it, or we can add it later.

if you could split that off into a new PR, i can help make sure the tests are passing etc so we can get it merged, if that sounds good?

thiswillbeyourgithub · 2025-01-09T13:28:30Z

Sounds good to me, thanks! I'll do that when I have some time in the next week

thiswillbeyourgithub added 24 commits December 4, 2024 17:35

add file settings.py to store settings

a3fd485

allows to set verbosity, low_memory flags etc Signed-off-by: thiswillbeyourgithub <[email protected]>

add a utils.py file that contains a make_dataset function

30b8923

Signed-off-by: thiswillbeyourgithub <[email protected]>

move DatasetEntry to utils.py

f65111d

Signed-off-by: thiswillbeyourgithub <[email protected]>

feat: add joblib memory for caching model activations

06a77fe

Signed-off-by: thiswillbeyourgithub <[email protected]>

fix: numpy v2 compatibility

f129548

should close vgel#51 Signed-off-by: thiswillbeyourgithub <[email protected]>

feat: add arg norm_type when training that allows setting l1 or l2 (o…

ee358c0

…r other) norm Signed-off-by: thiswillbeyourgithub <[email protected]>

fix: forgot an import

07c5b1f

Signed-off-by: thiswillbeyourgithub <[email protected]>

minor: add tqdm for getting activations and applying the new directions

5b96bab

Signed-off-by: thiswillbeyourgithub <[email protected]>

perf: potentially faster one liner to tokenize

f238580

Signed-off-by: thiswillbeyourgithub <[email protected]>

feat: use flag LOW_MEMORY to reduce the amount of memory needed when …

af8dd6d

…storing the arrays before computing directions Signed-off-by: thiswillbeyourgithub <[email protected]>

fix: imports

766ac7a

Signed-off-by: thiswillbeyourgithub <[email protected]>

update default model in example from mistral 7B v0.1 to v0.3

f6efc48

Signed-off-by: thiswillbeyourgithub <[email protected]>

docs: add more details on how to load the model, including quantizati…

6123b86

…on, avoiding OOM, etc Signed-off-by: thiswillbeyourgithub <[email protected]>

docs: use chat messages in example

50bdc42

Signed-off-by: thiswillbeyourgithub <[email protected]>

docs: in example, show how to login for models that require it

d5c3baa

Signed-off-by: thiswillbeyourgithub <[email protected]>

docs: add missing declaration of tokenizer

f9c90df

Signed-off-by: thiswillbeyourgithub <[email protected]>

fix: in example, move the tokens directly to the correct device

63baad5

Signed-off-by: thiswillbeyourgithub <[email protected]>

minor: changed default in the example

d523235

Signed-off-by: thiswillbeyourgithub <[email protected]>

docs: add link to github issue about OOM for gguf files

9136c5b

Signed-off-by: thiswillbeyourgithub <[email protected]>

docs: add more examples of models

0e07984

Signed-off-by: thiswillbeyourgithub <[email protected]>

fix: autocorrect chat templates

6f25c17

Signed-off-by: thiswillbeyourgithub <[email protected]>

docs: mention the new way to specify the layers

ef30c30

Signed-off-by: thiswillbeyourgithub <[email protected]>

thiswillbeyourgithub marked this pull request as ready for review December 4, 2024 17:48

This was referenced Dec 6, 2024

prototype support for n_components=1 YingfanWang/PaCMAP#83

Merged

Trying this out on llama3-8b , what range of layers do I use? #48

Closed

thiswillbeyourgithub added 2 commits December 11, 2024 01:00

fix: template function for non chats

ec3e28c

Signed-off-by: thiswillbeyourgithub <[email protected]>

docs: mention in readme that qwen2.5 7B works out of the box

21652a8

Signed-off-by: thiswillbeyourgithub <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhancements #55

Enhancements #55

thiswillbeyourgithub commented Dec 4, 2024

vgel commented Dec 14, 2024

vgel commented Dec 14, 2024

thiswillbeyourgithub commented Dec 14, 2024 •

edited

Loading

vgel commented Jan 8, 2025

vgel commented Jan 8, 2025 •

edited

Loading

thiswillbeyourgithub commented Jan 9, 2025

Enhancements #55

Are you sure you want to change the base?

Enhancements #55

Conversation

thiswillbeyourgithub commented Dec 4, 2024

Commits:

vgel commented Dec 14, 2024

vgel commented Dec 14, 2024

thiswillbeyourgithub commented Dec 14, 2024 • edited Loading

vgel commented Jan 8, 2025

vgel commented Jan 8, 2025 • edited Loading

thiswillbeyourgithub commented Jan 9, 2025

thiswillbeyourgithub commented Dec 14, 2024 •

edited

Loading

vgel commented Jan 8, 2025 •

edited

Loading