Yuandong Tian's picture

4 1

Yuandong Tian

tydsh

·

https://yuandong-tian.com/

AI & ML interests

Reinforcement Learning, Optimization, Representation Learning

Recent Activity

authored a paper 9 days ago

Towards General-Purpose Model-Free Reinforcement Learning

authored a paper 13 days ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

authored a paper about 2 months ago

Training Large Language Models to Reason in a Continuous Latent Space

View all activity

Organizations

None yet

Papers 20

arxiv:2501.16142

arxiv:2501.10799

arxiv:2412.06769

arxiv:2410.01779

models

None public yet

datasets

None public yet