Skip to content

Commit

Permalink
Merge pull request #2 from owenparsons/op_edits
Browse files Browse the repository at this point in the history
Suggestions for additional resources to add
  • Loading branch information
koayon authored Jan 30, 2025
2 parents 7775970 + ea3e888 commit df00fd7
Showing 1 changed file with 9 additions and 0 deletions.
9 changes: 9 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -126,6 +126,10 @@ Also DeepMind -->
> A highly readable library for distributed training of sparse autoencoders with a great API interface.
> Mostly uses standard methods and isn't as customisable. More of a training library than a research one.
**SAELens: Joseph Bloom, Curt Tigges and David Chanin (2024)**
[code](https://github.com/jbloomAus/SAELens)

> A library designed to help researchers train sparse autoencoders, analyze them with a focus on mechanistic interpretability, and generate insights to aid in developing safe and aligned AI systems.
<!-- ## Multimodal -->
<!-- ## AI Safety -->
Expand All @@ -150,6 +154,11 @@ Also DeepMind -->

> One of the first open source SAE dictionaries available. Released alongside training code.
**Open Source Sparse Autoencoders for all Residual Stream Layers of GPT2-Small: Bloom (2024)**
[blog](https://www.lesswrong.com/posts/f9EgfLSurAiqRJySD/open-source-sparse-autoencoders-for-all-residual-stream)

> A set of 12 SAEs for the GPT2 Small residual stream. The post gives a fairly comprehensive write up of the specific methods used.
## Other

**List of Favourite Mech Interp Papers, Neel Nanda (2024)**
Expand Down

0 comments on commit df00fd7

Please sign in to comment.