new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Dec 10

Submitted by

chujiezheng

ProcessBench: Identifying Process Errors in Mathematical Reasoning

·
9 authors

Submitted by

Shibo-UCSD

Training Large Language Models to Reason in a Continuous Latent Space

·
7 authors

Submitted by

avanturist

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

·
5 authors

Submitted by

kkr5155

Maya: An Instruction Finetuned Multilingual Multimodal Model

·
19 authors

Submitted by

nicolas-dufour

Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation

·
4 authors

Submitted by

tttoaster

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

·
4 authors

Submitted by

LooperXX

Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models

·
6 authors

Submitted by

xinlongwang

You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale

·
7 authors

Submitted by

ahatamiz

Gated Delta Networks: Improving Mamba2 with Delta Rule

·
3 authors

Submitted by

Hidir

MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

·
4 authors

Submitted by

mikonvergence

Global and Dense Embeddings of Earth: Major TOM Floating in the Latent Space

·
3 authors

Submitted by

AntoineGuedon

MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views

·
4 authors

Submitted by

huangsiteng

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

·
8 authors

Submitted by

xiaojunxu

Robust Multi-bit Text Watermark with LLM-based Paraphrasers

·
5 authors

Submitted by

mkhalifa

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

·
9 authors

Submitted by

howard06

Turbo3D: Ultra-fast Text-to-3D Generation

·
9 authors