Add learned activations L0 norm metric #91

lucyfarnik · 2023-11-21T19:58:39Z

Closes issue #87

lucyfarnik · 2023-11-21T20:04:40Z

Something to note on this PR: the L0NormMetric.create_progress_bar_postfix method isn't implemented yet. The abstract class didn't provide any info on what this method should do so I didn't have anything to go off of here.

alan-cooney

The L0 loss function already logs to wandb, so this doesn't add anything I'm afraid.

However if we decide we want L0 on the reconstructed neurons it would be reasonable to add this.

Btw I've also removed create_progress_bar_postfix and renamed create_weights_and_biases_log to just calculate.

lucyfarnik · 2023-11-23T07:45:31Z

I'm slightly confused, I'm pretty sure the only Lp norm that's getting logged to wandb is the L1? At least this was what I was seeing when running the code before I added these features in:

Re removing create_progress_bar_postfix: sounds great! I think that was the right call

alan-cooney

Ignore my previous review - was mixing things up!

I've added some suggestions.

I've also merged in main already and fixed the merge conflicts for you, as that was a result of me refactoring.

sparse_autoencoder/metrics/train/l0_norm_metric.py

sparse_autoencoder/metrics/train/tests/test_abstract_metric.py

sparse_autoencoder/metrics/train/tests/test_l0_norm_metric.py

--------- Co-authored-by: Alan Cooney <[email protected]>

lucyfarnik added 2 commits November 21, 2023 19:56

Added L0 norm metric

93077bb

Sorted import order

7b4fcf2

Formatting fixed

c2de5d3

alan-cooney requested changes Nov 22, 2023

View reviewed changes

alan-cooney added 2 commits November 23, 2023 09:29

Merge branch 'main' into pr/lucyfarnik/91

678c897

Fix merge conflicts

5d21e5c

alan-cooney requested changes Nov 23, 2023

View reviewed changes

sparse_autoencoder/metrics/train/l0_norm_metric.py Outdated Show resolved Hide resolved

sparse_autoencoder/metrics/train/tests/test_abstract_metric.py Outdated Show resolved Hide resolved

sparse_autoencoder/metrics/train/tests/test_l0_norm_metric.py Outdated Show resolved Hide resolved

alan-cooney added 2 commits November 26, 2023 18:58

Merge branch 'main' into pr/lucyfarnik/91

49efc25

Improve readability

bd79bc1

alan-cooney changed the title ~~Added L0 norm metric~~ Add learned activations L0 norm metric Nov 27, 2023

alan-cooney added 2 commits November 26, 2023 19:07

Remove the abstract metric tests

05ef8b9

Add to default outputs

44e96df

alan-cooney approved these changes Nov 27, 2023

View reviewed changes

alan-cooney merged commit 0b3337b into ai-safety-foundation:main Nov 27, 2023

HoagyC pushed a commit to HoagyC/sparse_autoencoder that referenced this pull request Dec 13, 2023

Add learned activations L0 norm metric (ai-safety-foundation#91)

c38860d

--------- Co-authored-by: Alan Cooney <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add learned activations L0 norm metric #91

Add learned activations L0 norm metric #91

lucyfarnik commented Nov 21, 2023

lucyfarnik commented Nov 21, 2023

alan-cooney left a comment

lucyfarnik commented Nov 23, 2023

alan-cooney left a comment

Add learned activations L0 norm metric #91

Add learned activations L0 norm metric #91

Conversation

lucyfarnik commented Nov 21, 2023

lucyfarnik commented Nov 21, 2023

alan-cooney left a comment

Choose a reason for hiding this comment

lucyfarnik commented Nov 23, 2023

alan-cooney left a comment

Choose a reason for hiding this comment