Fangkai Jiao's picture

1 4 16

Fangkai Jiao

chitanda

·

https://jiaofangkai.com/

SparkJiao

AI & ML interests

self-supervised pre-training, large language model and machine reasoning.

Recent Activity

authored a paper about 5 hours ago

Preference Optimization for Reasoning with Pseudo Feedback

upvoted a collection about 13 hours ago

updated a dataset about 13 hours ago

chitanda/code-synthetic-test-cases

View all activity

Organizations

chitanda's activity

authored a paper about 5 hours ago

Preference Optimization for Reasoning with Pseudo Feedback

Paper • 2411.16345 • Published Nov 25, 2024 • 1

upvoted a collection about 13 hours ago

PFPO

Resources for the paper Preference Optimization for Reasoning with Pseudo Feedback (ICLR 2025) • 4 items • Updated about 16 hours ago • 1

updated a dataset about 13 hours ago

chitanda/code-synthetic-test-cases

Preview • Updated about 13 hours ago • 2

upvoted a paper about 15 hours ago

Preference Optimization for Reasoning with Pseudo Feedback

Paper • 2411.16345 • Published Nov 25, 2024 • 1

updated a collection about 16 hours ago

PFPO

Resources for the paper Preference Optimization for Reasoning with Pseudo Feedback (ICLR 2025) • 4 items • Updated about 16 hours ago • 1

published a dataset about 16 hours ago

chitanda/code-synthetic-test-cases

Preview • Updated about 13 hours ago • 2

updated a collection about 17 hours ago

PFPO

Resources for the paper Preference Optimization for Reasoning with Pseudo Feedback (ICLR 2025) • 4 items • Updated about 16 hours ago • 1

updated a dataset about 17 hours ago

chitanda/mathscale4o-800k

Viewer • Updated about 17 hours ago • 492k • 1

updated a collection about 17 hours ago

PFPO

Resources for the paper Preference Optimization for Reasoning with Pseudo Feedback (ICLR 2025) • 4 items • Updated about 16 hours ago • 1

published a dataset about 17 hours ago

chitanda/mathscale4o-800k

Viewer • Updated about 17 hours ago • 492k • 1

updated a collection about 17 hours ago

PFPO

Resources for the paper Preference Optimization for Reasoning with Pseudo Feedback (ICLR 2025) • 4 items • Updated about 16 hours ago • 1

upvoted a paper 16 days ago

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Paper • 2501.12380 • Published 16 days ago • 81

liked a dataset 3 months ago

OpenCoder-LLM/opc-sft-stage1

Viewer • Updated Nov 24, 2024 • 4.22M • 786 • 58

upvoted a collection 3 months ago

OpenCoder Datasets

OpenCoder datasets! • 6 items • Updated Nov 15, 2024 • 39

liked a model 4 months ago

mistralai/Ministral-8B-Instruct-2410

Updated Dec 6, 2024 • 759k • 418

authored a paper 4 months ago

Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks

Paper • 2410.01428 • Published Oct 2, 2024

liked a dataset 4 months ago

yingyingzhang/metamath-qwen2-math

Viewer • Updated Oct 1, 2024 • 467k • 270 • 30

updated a dataset 4 months ago

chitanda/deepseek-math.7b.ins.completion_data

Preview • Updated Sep 29, 2024 • 52

authored 2 papers 8 months ago

How Much are LLMs Contaminated? A Comprehensive Survey and the LLMSanitize Library

Paper • 2404.00699 • Published Mar 31, 2024

Relevant or Random: Can LLMs Truly Perform Analogical Reasoning?

Paper • 2404.12728 • Published Apr 19, 2024