See our paper at https://huggingface.co/papers/2405.19332.
Shenao Zhang
ZhangShenao
AI & ML interests
None yet
Recent Activity
updated
a model
2 minutes ago
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5
updated
a dataset
6 minutes ago
ZhangShenao/rs_nnew-math_math-Meta-Llama-3-8B-Instruct-iter_sample_7500_temp_1.0_gen_30_mlr5e-5
updated
a model
about 9 hours ago
ZhangShenao/math_math-gemma-1.1-7b-it-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5
Organizations
Collections
3
-
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
Text Generation • Updated • 137 • 5 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-2
Text Generation • Updated • 10 -
ZhangShenao/SELM-Llama-3-8B-Instruct-iter-1
Text Generation • Updated • 13 -
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment
Paper • 2405.19332 • Published • 16
models
353
ZhangShenao/math_math-Meta-Llama-3-8B-Instruct-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5
Updated
•
7
ZhangShenao/math_math-gemma-1.1-7b-it-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5
Updated
•
7
ZhangShenao/math_gsm-Meta-Llama-3-8B-Instruct-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5
Updated
•
6
ZhangShenao/math_math-gemma-2-9b-it-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5
Updated
•
8
ZhangShenao/math_gsm-gemma-1.1-7b-it-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5
Updated
•
9
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-rs_nnew-sample_7500_temp_1.0_gen_30_mlr5e-5
Updated
•
9
ZhangShenao/math_gsm-Mistral-7B-Instruct-v0.2-rs_nnew-sample_7500_temp_1.0_gen_1_mlr5e-5
Updated
•
7
ZhangShenao/code_opencoder_edu-deepseek-coder-6.7b-instruct-rs-sample_4000_tp_gen30_temp1.0
Updated
•
21
ZhangShenao/code_opencoder_edu-deepseek-coder-6.7b-instruct-rs-sample_4000_tp_gen1_temp1.0
Updated
•
9
ZhangShenao/code_opencoder_edu-deepseek-coder-6.7b-instruct-rs-sample_4000_tp
Updated
•
8
datasets
211
ZhangShenao/rs_nnew-math_math-Meta-Llama-3-8B-Instruct-iter_sample_7500_temp_1.0_gen_30_mlr5e-5
Viewer
•
Updated
•
308
•
36
ZhangShenao/rs_nnew-math_math-gemma-1.1-7b-it-iter_sample_7500_temp_1.0_gen_30_mlr5e-5
Viewer
•
Updated
•
79
•
31
ZhangShenao/rs_nnew-math_math-gemma-1.1-7b-it-iter_sample_7500_temp_1.0_gen_1_mlr5e-5
Viewer
•
Updated
•
2
•
59
ZhangShenao/rs_nnew-math_gsm-Meta-Llama-3-8B-Instruct-iter_sample_7500_temp_1.0_gen_30_mlr5e-5
Viewer
•
Updated
•
6.29k
•
35
ZhangShenao/rs_nnew-math_math-gemma-2-9b-it-iter_sample_7500_temp_1.0_gen_30_mlr5e-5
Viewer
•
Updated
•
4.86k
•
28
ZhangShenao/rs_nnew-math_gsm-gemma-1.1-7b-it-iter_sample_7500_temp_1.0_gen_30_mlr5e-5
Viewer
•
Updated
•
4.78k
•
32
ZhangShenao/rs_nnew-math_gsm-Mistral-7B-Instruct-v0.2-iter_sample_7500_temp_1.0_gen_30_mlr5e-5
Viewer
•
Updated
•
6.26k
•
37
ZhangShenao/rs_nnew-math_gsm-Mistral-7B-Instruct-v0.2-iter_sample_7500_temp_1.0_gen_1_mlr5e-5
Viewer
•
Updated
•
1.87k
•
66
ZhangShenao/rs-code_opencoder_edu-deepseek-coder-6.7b-instruct-iter_sample_4000_tp_gen30_temp1.0
Viewer
•
Updated
•
775
•
27
ZhangShenao/rs-code_opencoder_edu-deepseek-coder-6.7b-instruct-iter_sample_4000_tp_gen1_temp1.0
Viewer
•
Updated
•
774
•
30