1 23 82

wattai

wattai

AI & ML interests

Im interested in generating BMS charts from text and music prompts.

Recent Activity

upvoted a paper 1 day ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

liked a dataset 3 days ago

openai/gsm8k

upvoted an article 3 days ago

Open R1: Update #2

View all activity

Organizations

None yet

wattai's activity

upvoted a paper 1 day ago

Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning

Paper • 2502.06060 • Published 5 days ago • 29

liked a dataset 3 days ago

openai/gsm8k

Viewer • Updated Jan 4, 2024 • 17.6k • 262k • 573

upvoted an article 3 days ago

Article

Open R1: Update #2

and 6 others •

4 days ago

• 154

liked a model 4 days ago

grapevine-AI/DeepSeek-R1-Distill-Qwen-32B-Japanese-GGUF

Updated 17 days ago • 1.43k • 3

liked a model 5 days ago

cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese

Text Generation • Updated 18 days ago • 14.4k • 234

liked a model 9 days ago

llm-jp/llm-jp-3-3.7b-instruct3

Text Generation • Updated 10 days ago • 158 • 1

liked a model 15 days ago

m-a-p/MERT-v1-330M

Audio Classification • Updated May 7, 2024 • 42.8k • 61

upvoted an article 17 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

18 days ago

• 734

liked a model 21 days ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 21 days ago • 1.52M • • 3.41k

upvoted a paper 21 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 23 days ago • 318

liked a model 25 days ago

deepseek-ai/DeepSeek-R1-Zero

Text Generation • Updated 5 days ago • 29.4k • 789

liked a model about 1 month ago

microsoft/phi-4

Text Generation • Updated 10 days ago • 601k • 1.72k

upvoted a paper about 1 month ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published Jan 9 • 93

liked 3 models about 1 month ago

upvoted 2 papers about 1 month ago

Dynamic Scaling of Unit Tests for Code Reward Modeling

Paper • 2501.01054 • Published Jan 2 • 17

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 48

upvoted 2 papers about 2 months ago

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 55

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 97