Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2502.04896

zai-org/CogVideoX1.5-5B-I2V

Image-to-Video • Updated Mar 18, 2025 • 1.2k • 113
tencent/HunyuanVideo-PromptRewrite

Updated Dec 6, 2024 • 155 • 52
moondream/moondream2-gguf

1B • Updated Apr 25, 2024 • 1.16k • 28
czyang/MultiFoley-VGGSound-Test-Audio

Viewer • Updated Feb 5, 2025 • 34.5k • 40 • 3

Big5Personality

Personality Traits in Large Language Models

Paper • 2307.00184 • Published Jul 1, 2023 • 20
Open-Sora: Democratizing Efficient Video Production for All

Paper • 2412.20404 • Published Dec 29, 2024 • 1
Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 106
SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Paper • 2412.10533 • Published Dec 13, 2024 • 5

Runtime error

687

Image Face Upscale Restoration-GFPGAN

📈

687

Enhance and upscale images with face restoration
Running on Zero

Featured

9.35k

FLUX.1 [dev]

🖥

9.35k

Generate images from text descriptions
Running on Zero

167

Stable Diffusion 3.5 Medium

🏃

167

Generate images with SD3.5
Runtime error

Featured

2.04k

IDM VTON

👕

2.04k

High-fidelity Virtual Try-on

Running on Zero

Featured

9.35k

FLUX.1 [dev]

🖥

9.35k

Generate images from text descriptions
Running on Zero

3.68k

Live Portrait

🤪

3.68k

Apply the motion of a video on a portrait
Running on CPU Upgrade

9.95k

Kolors Virtual Try-On

👕

9.95k

Try on clothes on a person image
Running on Zero

Featured

538

PuLID

⚡

538

Generate images with custom ID and style

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

Paper • 2401.15977 • Published Jan 29, 2024 • 39
Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23, 2024 • 86
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Paper • 2307.04725 • Published Jul 10, 2023 • 64
Boximator: Generating Rich and Controllable Motions for Video Synthesis

Paper • 2402.01566 • Published Feb 2, 2024 • 27

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 36
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 50
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 69
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 47

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Paper • 2410.10306 • Published Oct 14, 2024 • 56
Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 106
RunDiffusion/Juggernaut-XL-v9

Text-to-Image • Updated Dec 11, 2024 • 114k • 281

MaskBit: Embedding-free Image Generation via Bit Tokens

Paper • 2409.16211 • Published Sep 24, 2024 • 17
Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 106
Discrete Audio Tokens: More Than a Survey!

Paper • 2506.10274 • Published Jun 12, 2025 • 32
HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling

Paper • 2506.20452 • Published Jun 25, 2025 • 19

AI Paper of the Day

A collection of papers that I think are interesting, one added each day

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 23
OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models

Paper • 2402.08714 • Published Feb 13, 2024 • 15
Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15, 2024 • 25
RLVF: Learning from Verbal Feedback without Overgeneralization

Paper • 2402.10893 • Published Feb 16, 2024 • 12
Coercing LLMs to do and reveal (almost) anything

Paper • 2402.14020 • Published Feb 21, 2024 • 13

zai-org/CogVideoX1.5-5B-I2V

Image-to-Video • Updated Mar 18, 2025 • 1.2k • 113
tencent/HunyuanVideo-PromptRewrite

Updated Dec 6, 2024 • 155 • 52
moondream/moondream2-gguf

1B • Updated Apr 25, 2024 • 1.16k • 28
czyang/MultiFoley-VGGSound-Test-Audio

Viewer • Updated Feb 5, 2025 • 34.5k • 40 • 3

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

Paper • 2411.02337 • Published Nov 4, 2024 • 36
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 50
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published Nov 5, 2024 • 69
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization

Paper • 2410.08815 • Published Oct 11, 2024 • 47

Big5Personality

Personality Traits in Large Language Models

Paper • 2307.00184 • Published Jul 1, 2023 • 20
Open-Sora: Democratizing Efficient Video Production for All

Paper • 2412.20404 • Published Dec 29, 2024 • 1
Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 106
SUGAR: Subject-Driven Video Customization in a Zero-Shot Manner

Paper • 2412.10533 • Published Dec 13, 2024 • 5

Animate-X: Universal Character Image Animation with Enhanced Motion Representation

Paper • 2410.10306 • Published Oct 14, 2024 • 56
Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 106
RunDiffusion/Juggernaut-XL-v9

Text-to-Image • Updated Dec 11, 2024 • 114k • 281

Runtime error

687

Image Face Upscale Restoration-GFPGAN

📈

687

Enhance and upscale images with face restoration
Running on Zero

Featured

9.35k

FLUX.1 [dev]

🖥

9.35k

Generate images from text descriptions
Running on Zero

167

Stable Diffusion 3.5 Medium

🏃

167

Generate images with SD3.5
Runtime error

Featured

2.04k

IDM VTON

👕

2.04k

High-fidelity Virtual Try-on

MaskBit: Embedding-free Image Generation via Bit Tokens

Paper • 2409.16211 • Published Sep 24, 2024 • 17
Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7, 2025 • 106
Discrete Audio Tokens: More Than a Survey!

Paper • 2506.10274 • Published Jun 12, 2025 • 32
HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling

Paper • 2506.20452 • Published Jun 25, 2025 • 19

Running on Zero

Featured

9.35k

FLUX.1 [dev]

🖥

9.35k

Generate images from text descriptions
Running on Zero

3.68k

Live Portrait

🤪

3.68k

Apply the motion of a video on a portrait
Running on CPU Upgrade

9.95k

Kolors Virtual Try-On

👕

9.95k

Try on clothes on a person image
Running on Zero

Featured

538

PuLID

⚡

538

Generate images with custom ID and style

AI Paper of the Day

A collection of papers that I think are interesting, one added each day

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 23
OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

Paper • 2401.15977 • Published Jan 29, 2024 • 39
Lumiere: A Space-Time Diffusion Model for Video Generation

Paper • 2401.12945 • Published Jan 23, 2024 • 86
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning

Paper • 2307.04725 • Published Jul 10, 2023 • 64
Boximator: Generating Rich and Controllable Motions for Video Synthesis

Paper • 2402.01566 • Published Feb 2, 2024 • 27

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models

Paper • 2402.08714 • Published Feb 13, 2024 • 15
Data Engineering for Scaling Language Models to 128K Context

Paper • 2402.10171 • Published Feb 15, 2024 • 25
RLVF: Learning from Verbal Feedback without Overgeneralization

Paper • 2402.10893 • Published Feb 16, 2024 • 12
Coercing LLMs to do and reveal (almost) anything

Paper • 2402.14020 • Published Feb 21, 2024 • 13

Previous
1
2
3
4
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs