File size: 995 Bytes
b6d3a4f |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 |
---
license: apache-2.0
---
# 1. GemmaScope
Gemmascope is TODO
# 2. What Is `gemmascope-2b-pt-att`?
- `gemmascope-`: See 1.
- `2b-pt-`: These SAEs were trained on Gemma v2 2B base model (TODO link)
- `att`: These SAEs were trained on the attention layer outputs, before the final linear projection (TODO link ckkissane post).
## 3. GTM FAQ (TODO(conmy): delete for main rollout)
Q1: Why does this model exist in `gg-hf`?
A1: See https://docs.google.com/document/d/1bKaOw2mJPJDYhgFQGGVOyBB3M4Bm_Q3PMrfQeqeYi0M (Google internal only).
Q2: What does "SAE" mean?
A2: Sparse Autoencoder. See https://docs.google.com/document/d/1roMgCPMPEQgaNbCu15CGo966xRLToulCBQUVKVGvcfM (should be available to trusted HuggingFace collaborators, and Google too).
TODO(conmy): remove this when making the main repo.
## 4. Point of Contact
Point of contact: Arthur Conmy
Contact by email:
```python
''.join(list('moc.elgoog@ymnoc')[::-1])
```
HuggingFace account:
https://huggingface.co/ArthurConmyGDM
|