PersonaPlex: Voice and Role Control for Full Duplex Conversational Speech Models Paper • 2602.06053 • Published Jan 14 • 9
Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music Paper • 2604.10905 • Published 30 days ago • 28
Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models Paper • 2507.08128 • Published Jul 10, 2025 • 15
Negative-Guided Subject Fidelity Optimization for Zero-Shot Subject-Driven Generation Paper • 2506.03621 • Published Jun 4, 2025 • 22
Cosmos Collection ⚠️ This collection is archived. 👉 https://huggingface.co/collections/nvidia/nvidia-cosmos-2 • 14 items • Updated 4 days ago • 301
PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation Paper • 2410.01680 • Published Oct 2, 2024 • 34