LLMs
updated
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
Paper
•
2508.06471
•
Published
•
203
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable
Reinforcement Learning
Paper
•
2507.01006
•
Published
•
250
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality,
Long Context, and Next Generation Agentic Capabilities
Paper
•
2507.06261
•
Published
•
66
SmallThinker: A Family of Efficient Large Language Models Natively
Trained for Local Deployment
Paper
•
2507.20984
•
Published
•
58
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning
Attention
Paper
•
2506.13585
•
Published
•
273
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient
Robotics
Paper
•
2506.01844
•
Published
•
148
Qwen3 Embedding: Advancing Text Embedding and Reranking Through
Foundation Models
Paper
•
2506.05176
•
Published
•
77
A Survey of Reinforcement Learning for Large Reasoning Models
Paper
•
2509.08827
•
Published
•
190
Qwen3-Omni Technical Report
Paper
•
2509.17765
•
Published
•
146
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
Paper
•
2509.02547
•
Published
•
229