Customizing Text-to-Image Models with a Single Image Pair Paper • 2405.01536 • Published May 2, 2024 • 20
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models Paper • 2404.03913 • Published Apr 5, 2024
LCM-Lookahead for Encoder-based Text-to-Image Personalization Paper • 2404.03620 • Published Apr 4, 2024 • 1
Customizing Text-to-Image Diffusion with Camera Viewpoint Control Paper • 2404.12333 • Published Apr 18, 2024 • 1
jtatman/stable-diffusion-prompts-stats-full-uncensored Viewer • Updated Nov 8, 2024 • 897k • 179 • 60
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Paper • 2408.15998 • Published Aug 28, 2024 • 86
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs Paper • 2408.11813 • Published Aug 21, 2024 • 12
TokenPacker: Efficient Visual Projector for Multimodal LLM Paper • 2407.02392 • Published Jul 2, 2024 • 21
PALP: Prompt Aligned Personalization of Text-to-Image Models Paper • 2401.06105 • Published Jan 11, 2024 • 49
Training-Free Consistent Text-to-Image Generation Paper • 2402.03286 • Published Feb 5, 2024 • 66
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Paper • 2408.03703 • Published Aug 7, 2024
AutoPresent: Designing Structured Visuals from Scratch Paper • 2501.00912 • Published Jan 1 • 8
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published about 1 month ago • 48
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Paper • 2501.05452 • Published 28 days ago • 15
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Paper • 2501.08225 • Published 23 days ago • 18
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Paper • 2501.09503 • Published 21 days ago • 13