thestage.ai

Team

company

https://thestage.ai

TheStageAI

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

quazim updated a model 3 days ago

TheStageAI/thewhisper-large-v3-turbo

hypothetical new activity 6 days ago

TheStageAI/README:Update README.md

psynote123 published a model 7 days ago

TheStageAI/Wan2.2-ComfyUI

View all activity

quazim

updated a model 3 days ago

TheStageAI/thewhisper-large-v3-turbo

Automatic Speech Recognition • 0.8B • Updated 3 days ago • 2.19k • 14

hypothetical

in TheStageAI/README 6 days ago

Update README.md

#2 opened 6 days ago by

hypothetical

posted an update 7 days ago

Post

2565

We thought it would be easier, but finally we have integrated CuDNN Paged Attention to our models!

Read article here: https://app.thestage.ai/blog/Integrating-cuDNN-Paged-Attention-to-TheStage-AI-Inference?id=8

Llama-8B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Llama-3.1-8B-Instruct
Mistral-Small-24B with CuDNN paged attention, including B200 support: TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503

psynote123

published a model 7 days ago

TheStageAI/Wan2.2-ComfyUI

Updated 7 days ago

psynote123

updated a model 7 days ago

TheStageAI/Wan2.2-ComfyUI

Updated 7 days ago

hypothetical

published a model 7 days ago

TheStageAI/Elastic-Wan2.2-T2V-A14B-Diffusers

Text-to-Video • Updated Dec 1, 2025 • 4 • 1

hinairo

updated 2 models 8 days ago

TheStageAI/Elastic-Mistral-Small-3.1-24B-Instruct-2503

Text Generation • Updated 8 days ago • 24 • 2

TheStageAI/Elastic-Llama-3.1-8B-Instruct

Text Generation • Updated 8 days ago • 40 • 3

hypothetical

in TheStageAI/thewhisper-large-v3-turbo 14 days ago

add languages, base model, update license

#3 opened 14 days ago by

hypothetical

posted an update 14 days ago

Post

2014

We have updated our transcription model: TheStageAI/thewhisper-large-v3-turbo

– 6.00 WER on the English Open ASR Leaderboard
– 4.74 WER on the Multilingual Open ASR Leaderboard
– Beats NVIDIA Parakeet (6.34 WER) and Whisper-large-v3-turbo (7.8 WER)
– Strong improvements in Arabic, Hindi, Chinese
– Maintains quality with background and environmental noise
– Optimized inference engines for NVIDIA and Apple
– Hugging Face Transformers interface for easy use
– Best-in-class speed on NVIDIA GPUs and power efficiency on Apple devices
– NVIDIA Jetson Thor support

2 replies

quazim

in TheStageAI/thewhisper-large-v3-turbo 14 days ago

update-checkpoint-v2

#2 opened 14 days ago by

quazim

hypothetical

updated a collection 15 days ago

Elastic Diffusers

Collection

HuggingFace Diffusers models accelerated by TheStage AI ANNA: Automated NNs Accelerator. • 6 items • Updated 15 days ago • 2

hypothetical

posted an update about 2 months ago

Post

266

Hello guys! Maybe someone want to test our framework for automated model's compression. Here is what can be produced with it. Move the slider - compress/accelerate model, select point which like and compile. I can give an access, we are now improving and collecting comments from users

TheStageAI/ANNA-LLM