namgoodfire
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -22,12 +22,22 @@ By open-sourcing SAEs for leading open models, especially large-scale
|
|
22 |
models like Llama 3.3 70B, we aim to accelerate progress in interpretability research.
|
23 |
|
24 |
Our initial work with these SAEs has revealed promising applications in model steering,
|
25 |
-
enhancing jailbreaking safeguards, and interpretable classification methods
|
26 |
We look forward to seeing how the research community builds upon these
|
27 |
foundations and uncovers new applications.
|
28 |
|
29 |
#### Feature labels
|
30 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
31 |
## How to use
|
32 |
|
33 |
```python
|
|
|
22 |
models like Llama 3.3 70B, we aim to accelerate progress in interpretability research.
|
23 |
|
24 |
Our initial work with these SAEs has revealed promising applications in model steering,
|
25 |
+
enhancing jailbreaking safeguards, and interpretable classification methods.
|
26 |
We look forward to seeing how the research community builds upon these
|
27 |
foundations and uncovers new applications.
|
28 |
|
29 |
#### Feature labels
|
30 |
|
31 |
+
To explore the feature labels check out the [Goodfire Ember SDK](https://www.goodfire.ai/blog/announcing-goodfire-ember/).
|
32 |
+
The SDK provides an intuitive interface for interacting with these
|
33 |
+
features, allowing you to investigate how Llama processes information
|
34 |
+
and even steer its behavior. Get started with feature
|
35 |
+
exploration at [docs.goodfire.ai](https://docs.goodfire.ai) or install directly via:
|
36 |
+
|
37 |
+
```
|
38 |
+
pip install goodfire
|
39 |
+
```
|
40 |
+
|
41 |
## How to use
|
42 |
|
43 |
```python
|