wongyukim's picture

277 66

wongyukim

wongyukim

·

kimwongyuda

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Generating Multi-Image Synthetic Data for Text-to-Image Customization

upvoted a paper about 13 hours ago

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

upvoted a paper 1 day ago

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

View all activity

Organizations

None yet

wongyukim's activity

upvoted 2 papers about 13 hours ago

Generating Multi-Image Synthetic Data for Text-to-Image Customization

Paper • 2502.01720 • Published 3 days ago • 4

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Paper • 2502.02508 • Published 2 days ago • 16

upvoted 5 papers 1 day ago

MM-IQ: Benchmarking Human-Like Abstraction and Reasoning in Multimodal Models

Paper • 2502.00698 • Published 4 days ago • 21

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published 3 days ago • 31

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published 3 days ago • 53

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 4 days ago • 152

s1: Simple test-time scaling

Paper • 2501.19393 • Published 6 days ago • 88

upvoted a paper 4 days ago

GME: Improving Universal Multimodal Retrieval by Multimodal LLMs

Paper • 2412.16855 • Published Dec 22, 2024 • 2

upvoted 3 papers 6 days ago

o3-mini vs DeepSeek-R1: Which One is Safer?

Paper • 2501.18438 • Published 7 days ago • 21

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published 7 days ago • 49

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 7 days ago • 78

liked a Space 6 days ago

MMEB Leaderboard

The massive multimodal embedding benchmark

upvoted 2 papers 6 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 8 days ago • 50

Atla Selene Mini: A General Purpose Evaluation Model

Paper • 2501.17195 • Published 10 days ago • 30

upvoted 2 papers 7 days ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published 9 days ago • 32

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 9 days ago • 100

upvoted 3 papers 8 days ago

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 10 days ago • 24

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 12 days ago • 54

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published 12 days ago • 53

upvoted a paper 9 days ago

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 14 days ago • 22