Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
1
Yuandong Tian
tydsh
Follow
liliwululu's profile picture
ZhangRC's profile picture
lispczz's profile picture
9 followers
ยท
2 following
https://yuandong-tian.com/
tydsh
yuandong-tian
AI & ML interests
Reinforcement Learning, Optimization, Representation Learning
Recent Activity
authored
a paper
9 days ago
Towards General-Purpose Model-Free Reinforcement Learning
authored
a paper
13 days ago
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
authored
a paper
about 2 months ago
Training Large Language Models to Reason in a Continuous Latent Space
View all activity
Organizations
None yet
Papers
20
arxiv:
2501.16142
arxiv:
2501.10799
arxiv:
2412.06769
arxiv:
2410.01779
Expand 20 papers
models
None public yet
datasets
None public yet