Teaching Language Models to Critique via Reinforcement Learning Paper • 2502.03492 • Published 10 days ago • 21
NatureLM: Deciphering the Language of Nature for Scientific Discovery Paper • 2502.07527 • Published 4 days ago • 14
MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents Paper • 2502.05957 • Published 5 days ago • 13
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 23 days ago • 318