Great paper Differential Transformer Paper • 2410.05258 • Published Oct 7, 2024 • 182 PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 135 VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 118 o1-Coder: an o1 Replication for Coding Paper • 2412.00154 • Published Nov 29, 2024 • 44
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 135
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 118
LLM Training Tina: Tiny Reasoning Models via LoRA Paper • 2504.15777 • Published Apr 22, 2025 • 57 LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Paper • 2501.07124 • Published Jan 13, 2025 Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published Mar 26, 2025 • 59
LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Paper • 2501.07124 • Published Jan 13, 2025
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published Mar 26, 2025 • 59
Animation Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper • 2410.10306 • Published Oct 14, 2024 • 57
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper • 2410.10306 • Published Oct 14, 2024 • 57
Object detection Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper • 2405.10300 • Published May 16, 2024 • 30
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper • 2405.10300 • Published May 16, 2024 • 30
LLM Training Tina: Tiny Reasoning Models via LoRA Paper • 2504.15777 • Published Apr 22, 2025 • 57 LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Paper • 2501.07124 • Published Jan 13, 2025 Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published Mar 26, 2025 • 59
LLM360 K2: Building a 65B 360-Open-Source Large Language Model from Scratch Paper • 2501.07124 • Published Jan 13, 2025
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published Mar 26, 2025 • 59
Great paper Differential Transformer Paper • 2410.05258 • Published Oct 7, 2024 • 182 PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 135 VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 118 o1-Coder: an o1 Replication for Coding Paper • 2412.00154 • Published Nov 29, 2024 • 44
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published Dec 4, 2024 • 135
VisionZip: Longer is Better but Not Necessary in Vision Language Models Paper • 2412.04467 • Published Dec 5, 2024 • 118
Animation Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper • 2410.10306 • Published Oct 14, 2024 • 57
Animate-X: Universal Character Image Animation with Enhanced Motion Representation Paper • 2410.10306 • Published Oct 14, 2024 • 57
Object detection Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper • 2405.10300 • Published May 16, 2024 • 30
Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper • 2405.10300 • Published May 16, 2024 • 30