How we train saes replication #123

jbloomAus · 2024-05-07T11:00:23Z

Description

Please include a summary of the change and which issue is fixed. Please also include relevant motivation and context. List any dependencies that are required for this change.

Fixes # (issue)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

Checklist:

I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
I have not rewritten tests relating to key interfaces which would affect backward compatibility

You have tested formatting, typing and unit tests (acceptance tests not currently in use)

I have run make check-ci to check format and linting. (you can run make format to format code if needed.)

Performance Check.

If you have implemented a training change, please indicate precisely how performance changes with respect to the following metrics:

L0
CE Loss
MSE Loss

Please links to wandb dashboards with a control and test group: https://api.wandb.ai/links/jbloom/syh38mi2

…ix tests later)

codecov · 2024-05-07T11:07:04Z

Codecov Report

Attention: Patch coverage is 84.39306% with 27 lines in your changes are missing coverage. Please review.

Project coverage is 64.40%. Comparing base (4d12e7a) to head (eb35e67).
Report is 1 commits behind head on main.

Files	Patch %	Lines
sae_lens/training/cache_activations_runner.py	80.76%	7 Missing and 3 partials ⚠️
sae_lens/training/activations_store.py	71.87%	8 Missing and 1 partial ⚠️
sae_lens/training/train_sae_on_language_model.py	75.00%	1 Missing and 2 partials ⚠️
sae_lens/training/config.py	88.23%	1 Missing and 1 partial ⚠️
sae_lens/training/evals.py	0.00%	2 Missing ⚠️
sae_lens/training/optim.py	95.65%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #123      +/-   ##
==========================================
+ Coverage   59.91%   64.40%   +4.48%     
==========================================
  Files          17       17              
  Lines        1654     1753      +99     
  Branches      277      289      +12     
==========================================
+ Hits          991     1129     +138     
+ Misses        602      560      -42     
- Partials       61       64       +3

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

* l1 scheduler, clip grad norm * add provisional ability to normalize activations * notebook * change heuristic norm init to constant, report b_e and W_dec norms (fix tests later) * fix mse calculation * add benchmark test * update heuristic init to 0.1 * make tests pass device issue * continue rebase * use better args in benchmark * remove stack in get activations * broken! improve CA runner * get cache activation runner working and add some tests * add training steps to path * avoid ghost grad tensor casting * enable download of full dataset if desired * add benchmark for cache activation runner * add updated tutorial * format --------- Co-authored-by: Johnny Lin <[email protected]>

jbloomAus and others added 30 commits May 6, 2024 10:29

rebase

cee824a

l1 scheduler, clip grad norm

3223b3b

rebase

d6a670a

add provisional ability to normalize activations

f3e81ad

notebook

960affa

change heuristic norm init to constant, report b_e and W_dec norms (f…

cdde456

…ix tests later)

fix mse calculation

51d8281

add benchmark test

336f0dc

profiling the code

5eb9cdc

update heuristic init to 0.1

e882f5e

make tests pass device issue

cde300c

hopefully handle resuming conflict correctly

85d2102

continue rebase

078c6e6

use better args in benchmark

72b7509

remove stack in get activations

b9ec8ea

broken! improve CA runner

3e9d725

fix zeros bug

1bf5ddb

fix zeros bug

7456b44

get cache activation runner working and add some tests

46e569e

WIP

81b8a0f

current scripts

300d07f

string

cb7d4ff

make configs easier to access, gitignore cached activations

f48c5e4

use correct path

57d86fa

add training steps to path

ce74a79

avoid ghost grad tensor casting

8988ec3

enable download of full dataset if desired

168a066

current scripts

db91215

ignore fixture in benchmark

4e1b33b

add benchmark for cache activation runner

548c34d

jbloom-md and others added 9 commits May 6, 2024 11:05

ignore prof

01794f4

minor fixes

b6816d6

current script

7c9f32a

control

1407d50

reformat

d27f9ff

try to get tests passing

75eb8c6

make tests pass

e46d1d6

add updated tutorial

8272475

format

eb35e67

jbloomAus merged commit 5f46329 into main May 7, 2024
7 checks passed

jbloomAus deleted the how-we-train-saes-replication branch May 20, 2024 13:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How we train saes replication #123

How we train saes replication #123

jbloomAus commented May 7, 2024

codecov bot commented May 7, 2024 •

edited

Loading

How we train saes replication #123

How we train saes replication #123

Conversation

jbloomAus commented May 7, 2024

Description

Type of change

Checklist:

You have tested formatting, typing and unit tests (acceptance tests not currently in use)

Performance Check.

codecov bot commented May 7, 2024 • edited Loading

Codecov Report

codecov bot commented May 7, 2024 •

edited

Loading