Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Skbaai 's Collections
Application
Education
Extra
VDO
Audio
Code
Upscale
Prompt
Chat
NSFW
Lange

Audio

updated Mar 2
Upvote
-

  • Running on Zero
    Agents
    Featured
    957

    MMAudio — generating synchronized audio from video/text

    🔊
    957

    Generate synchronized audio for videos or from text prompts


  • Configuration error
    Agents
    326

    TangoFlux

    🚀
    326

    Text to Audio (Sound SFX) Generator


  • Build error
    Agents
    Featured
    2.37k

    Bark

    🐶
    2.37k

    Generate realistic audio from text


  • Paused
    Agents
    Featured
    202

    YuE

    👩
    202

    Generate music from lyrics and genre tags


  • Running on Zero
    MCP
    Featured
    570

    Image to Music v2

    🎺
    570

    Get a music sample inspired by the mood of an image


  • Running
    MCP
    164

    Image To Flux Prompt

    📉
    164

    Generate a detailed description for any photo


  • Running
    311

    Ebook2audiobook v26.5.10

    🚀
    311

    Turn any ebook into audiobook, 1107+ languages supported!


  • Running
    Featured
    1.28k

    Whisper Web

    🎤
    1.28k

    Transcribe audio files to text instantly


  • Running
    Agents
    Featured
    1.75k

    Realistic Text To Speech Unlimited

    🔥
    1.75k

    Free Text-To-Speech generator with Emotion control (OpenAI)


  • Running on T4
    Agents
    Featured
    372

    UnlimitedMusicGen

    🎼
    372

    unlimited Audio generation with a few added features


  • Build error
    Agents
    9

    Higgs-Audio Enhanced

    🎤
    9

    converts text into natural-sounding speech using AI.

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs