-
Phi-4 Technical Report
Paper • 2412.08905 • Published • 122 -
Evaluating and Aligning CodeLLMs on Human Preference
Paper • 2412.05210 • Published • 50 -
Evaluating Language Models as Synthetic Data Generators
Paper • 2412.03679 • Published • 48 -
Yi-Lightning Technical Report
Paper • 2412.01253 • Published • 28
Collections
Discover the best community collections!
Collections including paper arxiv:2409.12186
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 291k • • 1.96k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 674k • • 570 -
Qwen/Qwen2.5-Coder-32B-Instruct-GGUF
Text Generation • 33B • Updated • 16.1k • 177 -
dphn/dolphin-2.9.2-qwen2-72b
Text Generation • 73B • Updated • 787 • 170
-
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 35 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 39 -
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Paper • 2402.12875 • Published • 13 -
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper • 2410.00531 • Published • 34
-
Qwen2.5 Technical Report
Paper • 2412.15115 • Published • 376 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 152 -
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
Paper • 2409.12122 • Published • 4 -
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 211
-
Phi-4 Technical Report
Paper • 2412.08905 • Published • 122 -
Evaluating and Aligning CodeLLMs on Human Preference
Paper • 2412.05210 • Published • 50 -
Evaluating Language Models as Synthetic Data Generators
Paper • 2412.03679 • Published • 48 -
Yi-Lightning Technical Report
Paper • 2412.01253 • Published • 28
-
Qwen/Qwen2.5-Coder-32B-Instruct
Text Generation • 33B • Updated • 291k • • 1.96k -
Qwen/Qwen2.5-Coder-7B-Instruct
Text Generation • 8B • Updated • 674k • • 570 -
Qwen/Qwen2.5-Coder-32B-Instruct-GGUF
Text Generation • 33B • Updated • 16.1k • 177 -
dphn/dolphin-2.9.2-qwen2-72b
Text Generation • 73B • Updated • 787 • 170
-
LLMs + Persona-Plug = Personalized LLMs
Paper • 2409.11901 • Published • 35 -
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning
Paper • 2409.12183 • Published • 39 -
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems
Paper • 2402.12875 • Published • 13 -
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
Paper • 2410.00531 • Published • 34
-
Qwen2.5 Technical Report
Paper • 2412.15115 • Published • 376 -
Qwen2.5-Coder Technical Report
Paper • 2409.12186 • Published • 152 -
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement
Paper • 2409.12122 • Published • 4 -
Qwen2.5-VL Technical Report
Paper • 2502.13923 • Published • 211