ICML2023
AI & ML interests
None defined yet.
Recent Activity
View all activity
ICML2023's activity
ameerazam08
posted
an
update
7 days ago
Post
1595
R1 is out! And with a lot of other R1 releated models...
hysts
updated
a
Space
26 days ago
vwxyzjn
authored
5
papers
about 1 month ago
The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization
Paper
•
2403.17031
•
Published
•
6
A2C is a special case of PPO
Paper
•
2205.09123
•
Published
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
Paper
•
2410.18252
•
Published
•
5
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training
Paper
•
2411.15124
•
Published
•
59
2 OLMo 2 Furious
Paper
•
2501.00656
•
Published
•
16
mbrack
authored
a
paper
about 2 months ago
Post
8287
Google drops Gemini 2.0 Flash Thinking
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
a new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and more
now available in anychat, try it out: akhaliq/anychat
Kameshr
authored
a
paper
about 2 months ago
Post
9248
QwQ-32B-Preview is now available in anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
A reasoning model that is competitive with OpenAI o1-mini and o1-preview
try it out: akhaliq/anychat
Post
3943
Post
2900
anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
supports chatgpt, gemini, perplexity, claude, meta llama, grok all in one app
try it out there: akhaliq/anychat
xzyao
authored
a
paper
3 months ago
Lupin1998
authored
2
papers
4 months ago