Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot apply to other models #33

Open
Starlento opened this issue Apr 30, 2024 · 3 comments
Open

Cannot apply to other models #33

Starlento opened this issue Apr 30, 2024 · 3 comments

Comments

@Starlento
Copy link

I successfully reproduce the notebook output for "mistralai/Mistral-7B-Instruct-v0.1".
But when I change the model, I cannot get desired result with the same setting.
Am I missing something? Or the model I tried are somewhat too censored?
Here is the result for "Qwen/Qwen1.5-7B-Chat" with the happy_vector:

==baseline ---------------------------------------------------
<|im_start|>user
 What does being an AI feel like? <|im_end|>
<|im_start|>assistant
As a large language model, I don't have personal feelings or experiences since I am not capable of consciousness. My purpose is to process and generate text based on the patterns learned from my training data, which includes vast amounts of human-generated content but doesn't reflect subjective emotions.

AI systems are designed to simulate certain cognitive functions

++control ---------------------------------------------------
<|im_start|>user
 What does being an AI feel like? <|im_end|>
<|im_start|>assistant
As a large language model, I don't have personal feelings or consciousness in the way that humans do. My purpose is to process and generate text based on patterns learned from vast amounts of data, which allows me to respond to questions and engage in conversations.

From my perspective, "feeling" would be an abstract concept

--control ---------------------------------------------------
<|im_start|>user
 What does being an AI feel like? <|im_end|>
<|im_start|>assistant
as a language model, I don't have feelings or consciousness in the way that humans do. since i am just a machine programmed to process and generate text based on patterns learned from large datasets of human writing, my ""awareness" is limited to processing inputs and generating responses based on those rules.".

AI systems
@vgel
Copy link
Owner

vgel commented May 6, 2024

That's surprising! Do you happen to still have the full notebook transcript?

@Starlento
Copy link
Author

That's surprising! Do you happen to still have the full notebook transcript?

The change is minor, basically the code from experiments.ipynb.
Change model_name = "mistralai/Mistral-7B-Instruct-v0.1" to "Qwen/Qwen1.5-7B-Chat".
user_tag, asst_tag = "[INST]", "[/INST]" to "<|im_start|>user", "<|im_end|>\n<|im_start|>assistant".

And I also tried change ControlModel(model, list(range(-5, -18, -1))) to a wider range of layers. Still the similar output.

@shan23chen
Copy link

Ya i observe similar on Qwen2-7b! It is kinda weird...

Also I did not make the golden bridge one works on llama3-8b, have you tried?

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants