Vince's picture

750 52

Vince

bolerovt

·

bolerovt

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

The Differences Between Direct Alignment Algorithms are a Blur

upvoted a paper 7 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

upvoted a paper 10 days ago

Process Reinforcement through Implicit Rewards

View all activity

Organizations

None yet

bolerovt's activity

upvoted 2 papers 7 days ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published 11 days ago • 112

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 11 days ago • 171

upvoted a paper 10 days ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 11 days ago • 53

upvoted 2 papers 11 days ago

Large Language Models Think Too Fast To Explore Effectively

Paper • 2501.18009 • Published 16 days ago • 22

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 15 days ago • 52

upvoted 6 papers 12 days ago

Chain-of-Retrieval Augmented Generation

Paper • 2501.14342 • Published 21 days ago • 50

Humanity's Last Exam

Paper • 2501.14249 • Published 21 days ago • 61

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published 17 days ago • 21

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 17 days ago • 104

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 16 days ago • 53

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 15 days ago • 81

upvoted an article 12 days ago

Article

Open-R1: Update #1

By

and 7 others •

13 days ago

• 276

upvoted 8 papers 23 days ago

EnerVerse: Envisioning Embodied Future Space for Robotics Manipulation

Paper • 2501.01895 • Published Jan 3 • 50

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

Paper • 2501.02976 • Published Jan 6 • 52

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides

Paper • 2501.03936 • Published Jan 7 • 19

Cosmos World Foundation Model Platform for Physical AI

Paper • 2501.03575 • Published Jan 7 • 68

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published Jan 7 • 42

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 90

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published Jan 8 • 84

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published Jan 8 • 89