1195 1194

Starstrek

Stars321123

Stars321

AI & ML interests

Recent Activity

upvoted a paper about 11 hours ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

liked a model about 11 hours ago

Beckham808/LightGen

upvoted a paper about 11 hours ago

4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer

View all activity

Organizations

upvoted a paper about 11 hours ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 2 days ago • 135

liked a model about 11 hours ago

Beckham808/LightGen

Text-to-Image • Updated Mar 13 • 7

upvoted a paper about 11 hours ago

4DLangVGGT: 4D Language-Visual Geometry Grounded Transformer

Paper • 2512.05060 • Published 2 days ago • 17

liked a model about 11 hours ago

microsoft/VibeVoice-Realtime-0.5B

Text-to-Speech • 1B • Updated 2 days ago • 20.1k • 355

upvoted a paper about 15 hours ago

What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards

Paper • 2512.00425 • Published 7 days ago • 45

liked a model about 15 hours ago

easygoing0114/Z-Image_clear_vae

Updated about 2 hours ago • 11

upvoted 2 papers about 15 hours ago

TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models

Paper • 2512.02014 • Published 5 days ago • 48

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published 4 days ago • 25

upvoted 2 collections about 16 hours ago

Self-Calibration

Collection

Efficient Test-Time Scaling via Self-Calibration https://arxiv.org/abs/2503.00031 • 7 items • Updated Jun 8 • 3

PosS-Speculative-Decoding

Collection

This collection contains models of the paper "PosS:Position Specialist Generates Better Draft for Speculative Decoding" • 9 items • Updated Jun 5 • 2

upvoted 2 papers about 17 hours ago

Guided Self-Evolving LLMs with Minimal Human Supervision

Paper • 2512.02472 • Published 5 days ago • 47

Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion

Paper • 2512.04926 • Published 2 days ago • 28

upvoted a paper 2 days ago

PixelDiT: Pixel Diffusion Transformers for Image Generation

Paper • 2511.20645 • Published 11 days ago • 24

upvoted 4 articles 2 days ago

Article

Swift Transformers Reaches 1.0 – and Looks to the Future

Sep 26

•

Article

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

17 days ago

•

Article

We Got Claude to Fine-Tune an Open Source LLM

3 days ago

•

221

Article

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

3 days ago

•

liked a Space 3 days ago

Huggingface Leaderboard

🏆

101

Generate Hugging Face leaderboard stats

liked 2 models 3 days ago

nvidia/NVLM-D-72B

Image-Text-to-Text • 79B • Updated Jan 14 • 55.4k • 775

Qwen/Qwen2.5-Math-72B

Text Generation • 73B • Updated Sep 23, 2024 • 1.37k • 17

Starstrek

AI & ML interests

Recent Activity

Organizations

Stars321123's activity

Swift Transformers Reaches 1.0 – and Looks to the Future

Introducing AnyLanguageModel: One API for Local and Remote LLMs on Apple Platforms

We Got Claude to Fine-Tune an Open Source LLM

Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand

Huggingface Leaderboard