new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Dec 30

Submitted by

akhaliq

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

·
8 authors

Submitted by

akhaliq

1.58-bit FLUX

·
7 authors

Submitted by

akhaliq

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

·
27 authors

Submitted by

ZehanWang

Orient Anything: Learning Robust Object Orientation Estimation from Rendering 3D Models

·
6 authors

Submitted by

ynhe

Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment

·
12 authors

Submitted by

KyleLin

From Elements to Design: A Layered Approach for Automatic Graphic Design Composition

·
6 authors

Submitted by

BestWishYsh

VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Models

·
9 authors

Submitted by

mskrt

The Superposition of Diffusion Models Using the Itô Density Estimator

·
5 authors

Submitted by

jacksukk

Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging

·
6 authors

Submitted by

yanlinf

CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era

·
3 authors

Submitted by

risashinoda

SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images

·
5 authors