Article 4 Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes?
hbXNov/qwen_2p5_1p5b_instruct_distill_qwen_1p5b_gpt_4o_verify_1e-5_3072_e6-checkpoint-7536-merged Updated 8 days ago • 1.9k
hbXNov/qwen_2p5_1p5b_instruct_distill_qwen_1p5b_gpt_4o_verify_5e-7_3072_merged Updated 9 days ago • 2
hbXNov/llama3.1-8b_train_gpt_4o_verifications_e3_lr5e-7-add-special-true-len3072-19233-merged Updated Dec 27, 2024 • 4
hbXNov/llama3.1-8b_train_gpt_4o_verifications_e3_lr5e-7-add-special-true-31389-merged Updated Dec 26, 2024 • 128
hbXNov/distill_qwen_7b_math_train_question_solution_gpt_4o_verify Viewer • Updated 6 days ago • 7.49k • 11