SE-Bench: Benchmarking Self-Evolution with Knowledge Internalization Paper β’ 2602.04811 β’ Published Feb 4 β’ 2
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper β’ 2601.14253 β’ Published Jan 20 β’ 10
V-DPM: 4D Video Reconstruction with Dynamic Point Maps Paper β’ 2601.09499 β’ Published Jan 14 β’ 9
UM-Text: A Unified Multimodal Model for Image Understanding Paper β’ 2601.08321 β’ Published Jan 13 β’ 11
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation Paper β’ 2601.03955 β’ Published Jan 7 β’ 3
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper β’ 2512.24724 β’ Published Dec 31, 2025 β’ 8
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper β’ 2512.24766 β’ Published Dec 31, 2025 β’ 9
What matters for Representation Alignment: Global Information or Spatial Structure? Paper β’ 2512.10794 β’ Published Dec 11, 2025 β’ 9
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper β’ 2512.07843 β’ Published Nov 24, 2025 β’ 22
view post Post 21527 Want to iterate on a Hugging Face Space with an LLM? Now you can easily convert any HF entire repo (Model, Dataset or Space) to a text file and feed it to a language model! multimodalart/repo2txt See translation 1 reply Β· π€ 3 3 π 2 2 π 1 1 + Reply
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published Oct 9, 2025 β’ 39
view post Post 18237 Self-Forcing - a real-time video distilled model from Wan 2.1 by @adobe is out, and they open sourced it πI've built a live real time demo on Spaces πΉπ¨ multimodalart/self-forcing See translation 6 replies Β· β€οΈ 12 12 π₯ 6 6 + Reply
view post Post 52788 Google drops Gemini 2.0 Flash Thinkinga new experimental model that unlocks stronger reasoning capabilities and shows its thoughts. The model plans (with thoughts visible), can solve complex problems with Flash speeds, and morenow available in anychat, try it out: https://huggingface.co/spaces/akhaliq/anychat See translation 5 replies Β· π 12 12 π₯ 6 6 π 4 4 π 2 2 + Reply
view post Post 51806 QwQ-32B-Preview is now available in anychatA reasoning model that is competitive with OpenAI o1-mini and o1-previewtry it out: https://huggingface.co/spaces/akhaliq/anychat See translation 2 replies Β· β€οΈ 3 3 π 2 2 + Reply
view post Post 5105 New model drop in anychatallenai/Llama-3.1-Tulu-3-8B is now availabletry it here: https://huggingface.co/spaces/akhaliq/anychat See translation π₯ 3 3 π 1 1 + Reply
view post Post 3850 anychatsupports chatgpt, gemini, perplexity, claude, meta llama, grok all in one apptry it out there: https://huggingface.co/spaces/akhaliq/anychat β€οΈ 7 7 π 4 4 π₯ 2 2 + Reply
view post Post 35605 New feature π₯ Image models and LoRAs now have little previews π€If you don't know where to start to find them, I invite you to browse cool LoRAs in the profile of some amazing fine-tuners: @artificialguybr , @alvdansen , @DoctorDiffusion , @e-n-v-y , @KappaNeuro @ostris 3 replies Β· β€οΈ 13 13 π 1 1 π€ 1 1 + Reply