Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 23 days ago • 53
bluepen5805/DeepSeek-R1-Distill-Qwen-14B-Japanese-gguf Text Generation • Updated 10 days ago • 17.1k • 31
TinySwallow Collection Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 8 days ago • 12
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Paper • 2501.11858 • Published 17 days ago • 5
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 16 days ago • 48