Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2409.11340

Prompt Expansion

IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation

Paper • 2409.08240 • Published Sep 12, 2024 • 20
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Paper • 2410.07171 • Published Oct 9, 2024 • 42
EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models

Paper • 2410.07133 • Published Oct 9, 2024 • 19
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110

Papers - Image - CoT

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110
LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 113

Running on Zero

156

156

FLUX.1 Dev Inpainting Model Beta GPU

🏆

Replace parts of an image using text prompts
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110

📑Trending Papers - September 9⃣️

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published Sep 18, 2024 • 140
Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5, 2024 • 89
Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency

Paper • 2409.02634 • Published Sep 4, 2024 • 93
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110

Omni-Generation

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110
Video-Guided Foley Sound Generation with Multimodal Controls

Paper • 2411.17698 • Published Nov 26, 2024 • 8
FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Paper • 2412.01064 • Published Dec 2, 2024 • 26
OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows

Paper • 2412.01169 • Published Dec 2, 2024 • 12

Interesting papers

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110
NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 73

Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 26
OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110

Diffusion-Papers

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110

image generation

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 110

Previous
1
2
3
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs