new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Jul 23

Submitted by

akhaliq

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

·
8 authors

Submitted by

srvm

Compact Language Models via Pruning and Knowledge Distillation

·
9 authors

Submitted by

Ningyu

Knowledge Mechanisms in Large Language Models: A Survey and Perspective

·
13 authors

Submitted by

akhaliq

NNsight and NDIF: Democratizing Access to Foundation Model Internals

·
20 authors

Submitted by

taesiri

VideoGameBunny: Towards vision assistants for video games

·
2 authors

Submitted by

grafft

POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation

·
6 authors

Submitted by

teowu

LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding

·
4 authors

Submitted by

piergs

BOND: Aligning LLMs with Best-of-N Distillation

·
20 authors

Submitted by

yulunliu

BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes

·
6 authors

Submitted by

akhaliq

Artist: Aesthetically Controllable Text-Driven Stylization without Training

·
2 authors

Submitted by

feifeiobama

Discrete Flow Matching

·
8 authors

Submitted by

mmhamdy

Consent in Crisis: The Rapid Decline of the AI Data Commons

·
49 authors

Submitted by

akhaliq

HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions

·
5 authors

Submitted by

akhaliq

Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models

·
7 authors

Submitted by

xhyandwyy

MIBench: Evaluating Multimodal Large Language Models over Multiple Images

·
11 authors

Submitted by

akhaliq

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

·
20 authors

Submitted by

Ori

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

·
6 authors

Submitted by

akhaliq

MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation

·
4 authors

Submitted by

liuhuohuo

CGB-DM: Content and Graphic Balance Layout Generation with Transformer-based Diffusion Model

·
5 authors

Submitted by

akhaliq

Local All-Pair Correspondence for Point Tracking

·
6 authors

Submitted by

akhaliq

ThermalNeRF: Thermal Radiance Fields

·
4 authors

Submitted by

akhaliq

Temporal Residual Jacobians For Rig-free Motion Transfer

·
7 authors

Submitted by

akhaliq

GET-Zero: Graph Embodiment Transformer for Zero-shot Embodiment Generalization

·
2 authors

Submitted by

davidchan

Visual Haystacks: Answering Harder Questions About Sets of Images

·
7 authors