PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding Paper β’ 2501.16411 β’ Published 10 days ago β’ 17
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Paper β’ 2501.11858 β’ Published 17 days ago β’ 5
Efficient-Large-Model/Sana_1600M_512px_MultiLing_diffusers Text-to-Image β’ Updated 27 days ago β’ 3
Efficient-Large-Model/Sana_1600M_1024px_diffusers Text-to-Image β’ Updated 27 days ago β’ 150 β’ 15
Efficient-Large-Model/Sana_1600M_1024px_MultiLing_diffusers Text-to-Image β’ Updated 27 days ago β’ 23 β’ 2