vwxyzjn (Shengyi Costa Huang)

Articles 0

Article

112

How NuminaMath Won the 1st AIMO Progress Prize

Article

57

Preference Optimization for Vision Language Models

View all Articles

Collections 4

models 393

vwxyzjn/ppo_async

Updated 1 day ago

vwxyzjn/ppo_sync

Updated 1 day ago

vwxyzjn/online_dpo_sync

Updated 1 day ago

vwxyzjn/online_dpo_async

Updated 1 day ago • 2

vwxyzjn/rm_zephyr_new

Text Classification • Updated Sep 26, 2024 • 9

vwxyzjn/online_dpo_vllm_thread_beta_0.03__allenai_open_instruct_dev

Updated Sep 11, 2024

vwxyzjn/reward_modeling__EleutherAI_pythia-14m

Updated Aug 24, 2024 • 11

vwxyzjn/online_dpo_vllm__vwxyzjn_btulu

Updated Aug 23, 2024 • 5

vwxyzjn/online_dpo_vllm__allenai_llama-3-tulu-2-8b

Updated Aug 19, 2024 • 8

vwxyzjn/btulu

Text Generation • Updated Aug 19, 2024 • 46

datasets 284

vwxyzjn/old-tulu-3-mix-pref-dataset

Viewer • Updated 16 days ago • 149k • 44

vwxyzjn/old-tulu-3-mix-dataset

Viewer • Updated 16 days ago • 934k • 127

vwxyzjn/norobot_pref_4860

Viewer • Updated Oct 2, 2024 • 59.9k • 59

vwxyzjn/norobot_generation_4860

Viewer • Updated Oct 2, 2024 • 29.9k • 42

vwxyzjn/norobot_pref_465

Viewer • Updated Oct 2, 2024 • 59.4k • 59

vwxyzjn/norobot_generation_465

Viewer • Updated Oct 2, 2024 • 29.7k • 51

vwxyzjn/norobot_generation_16325

Viewer • Updated Oct 2, 2024 • 29.7k • 78

vwxyzjn/norobot_pref_11421

Viewer • Updated Oct 2, 2024 • 56.1k • 38

vwxyzjn/norobot_generation_11421

Viewer • Updated Oct 2, 2024 • 28k • 124

vwxyzjn/rejection_sampling_scores_1727889563

Viewer • Updated Oct 2, 2024 • 240 • 21

Shengyi Costa Huang

AI & ML interests

Organizations

Articles 0

How NuminaMath Won the 1st AIMO Progress Prize

Preference Optimization for Vision Language Models

Collections 4

Papers 10

spaces 4 Sort: Recently updated

Test

Aim

Vwxyzjn Testyes4

Pyserini Wikipedia Kilt Doc

models 393 Sort: Recently updated

datasets 284 Sort: Recently updated

spaces 4

models 393

datasets 284