daiwei chen
daiweichen
AI & ML interests
representation learning, foundation models, preference learning
Recent Activity
liked
a model
7 days ago
deepseek-ai/DeepSeek-R1-Distill-Llama-8B
liked
a dataset
about 2 months ago
HannahRoseKirk/prism-alignment
liked
a model
about 2 months ago
google/gemma-2-2b-it
Organizations
daiweichen's activity
Attention doesn't work for all layers except for the first layer
#79 opened 3 months ago
by
daiweichen