Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
tttx
/
sft_r1_7b
like
0
Follow
tttx
5
PEFT
Safetensors
tttx/r1-trajectories-collection-round-2
tttx/r1-trajectories-arcagi-barc
qwen2
alignment-handbook
trl
sft
Generated from Trainer
License:
mit
Model card
Files
Files and versions
Community
Use this model
main
sft_r1_7b
Commit History
End of training
81e7f2a
verified
aadityap
commited on
11 days ago
Model save
9282164
verified
aadityap
commited on
11 days ago
Training in progress, epoch 3
5bdc6c4
verified
aadityap
commited on
11 days ago
Training in progress, epoch 2
f047e2d
verified
aadityap
commited on
11 days ago
Training in progress, epoch 1
a9b3e2e
verified
aadityap
commited on
11 days ago
End of training
4a6cb1e
verified
aadityap
commited on
12 days ago
Model save
215344e
verified
aadityap
commited on
12 days ago
initial commit
c8b2662
verified
aadityap
commited on
12 days ago