Submitted by osanseviero 53 Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming · 2 authors 6
Submitted by twinsken 37 VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters · 6 authors 2