GitHub - HarryMayne/SV_interpretability: Code for the paper "Can sparse autoencoders be used to decompose and interpret steering vectors?"

HarryMayne / SV_interpretability Public

Notifications You must be signed in to change notification settings
Fork 0
Star 3

Code for the paper "Can sparse autoencoders be used to decompose and interpret steering vectors?"

3 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md

Repository files navigation

Can sparse autoencoders be used to decompose and interpret steering vectors?

Code coming soon!

About

Code for the paper "Can sparse autoencoders be used to decompose and interpret steering vectors?"

Report repository

Releases

No releases published

Packages

No packages published