YYY
zzfive
AI & ML interests
None yet
Recent Activity
updated
a collection
about 10 hours ago
datasets
updated
a collection
about 10 hours ago
3d
updated
a collection
about 10 hours ago
agent
Organizations
None yet
Collections
16
-
RL + Transformer = A General-Purpose Problem Solver
Paper • 2501.14176 • Published • 22 -
Towards General-Purpose Model-Free Reinforcement Learning
Paper • 2501.16142 • Published • 24 -
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 100 -
MaxInfoRL: Boosting exploration in reinforcement learning through information gain maximization
Paper • 2412.12098 • Published • 4
models
9
zzfive/ComfyChat-InternLM2.5-7b-v2-2
Updated
•
1
zzfive/ComfyChat-Qwen2-7b-instruct-v2-2
Updated
•
2
zzfive/ComfyChat-Llama3-8b-instruct-v2-2
Updated
•
7
•
1
zzfive/ComfyChat-InternLM2-7b-v2-1
Feature Extraction
•
Updated
•
4
zzfive/ComfyChat-InternLM2.5-7b-v2-1
Updated
•
5
zzfive/ComfyChat-InternLM2-1-8b-v2-1
Feature Extraction
•
Updated
•
103
zzfive/ComfyChat-InternLM2-7b-v1
Feature Extraction
•
Updated
•
2
zzfive/ComfyChat-InternLM2-1-8b-v1
Feature Extraction
•
Updated
•
104
zzfive/DAIGT-InternLM-small
Feature Extraction
•
Updated
•
4