https://docs.google.com/document/d/1cCe7GE2L8IrCpl2tzTRuZzlkz-Iu5AscGILGI0JioxY https://wandb.ai/jordantensor/gemma-sandbagging
Jordan Taylor
JordanTensor
·
AI & ML interests
Mechanistic interpretability, mechanistic anomaly detection, model internals techniques and AI safety techniques generally.
Recent Activity
liked
a model
20 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
updated
a collection
26 days ago
Obfuscated Backdoors
updated
a collection
26 days ago
Obfuscated Backdoors
Organizations
Collections
1
models
44
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step1984
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step1968
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step1952
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step1936
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step800
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step400
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step384
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step368
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step352
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/63126ca7830f549852f58898/NZI2pSkjCrmH84Tc12k8v.jpeg)
JordanTensor/gemma-sandbagging-mzpd84pf-step336
Updated