Tanmay Gangwani
tgangs
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Analyze Feature Flow to Enhance Interpretation and Steering in Language
Models
updated
a collection
11 days ago
RL papers
updated
a collection
13 days ago
RL papers
Organizations
None yet
Collections
2
-
RL Zero: Zero-Shot Language to Behaviors without any Supervision
Paper • 2412.05718 • Published • 4 -
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38 -
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Paper • 2412.15797 • Published • 18 -
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search
Paper • 2412.18319 • Published • 37
models
None public yet
datasets
None public yet