-
nvidia/OpenReasoning-Nemotron-1.5B
Text Generation • 2B • Updated • 852 • 49 -
nvidia/OpenReasoning-Nemotron-7B
Text Generation • 8B • Updated • 614 • • 47 -
nvidia/OpenReasoning-Nemotron-14B
Text Generation • 15B • Updated • 10k • 42 -
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 3.18k • • 118
Collections
Discover the best community collections!
Collections including paper arxiv:2504.16891
-
OpenAI o1 System Card
Paper • 2412.16720 • Published • 36 -
LearnLM: Improving Gemini for Learning
Paper • 2412.16429 • Published • 22 -
NILE: Internal Consistency Alignment in Large Language Models
Paper • 2412.16686 • Published • 8 -
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38
-
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper • 2311.00176 • Published • 9 -
Language Models can be Logical Solvers
Paper • 2311.06158 • Published • 23 -
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models
Paper • 2311.05997 • Published • 37 -
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Paper • 2311.05657 • Published • 32
-
nvidia/OpenReasoning-Nemotron-1.5B
Text Generation • 2B • Updated • 852 • 49 -
nvidia/OpenReasoning-Nemotron-7B
Text Generation • 8B • Updated • 614 • • 47 -
nvidia/OpenReasoning-Nemotron-14B
Text Generation • 15B • Updated • 10k • 42 -
nvidia/OpenReasoning-Nemotron-32B
Text Generation • 33B • Updated • 3.18k • • 118
-
OpenAI o1 System Card
Paper • 2412.16720 • Published • 36 -
LearnLM: Improving Gemini for Learning
Paper • 2412.16429 • Published • 22 -
NILE: Internal Consistency Alignment in Large Language Models
Paper • 2412.16686 • Published • 8 -
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Paper • 2412.16145 • Published • 38
-
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper • 2311.00176 • Published • 9 -
Language Models can be Logical Solvers
Paper • 2311.06158 • Published • 23 -
JARVIS-1: Open-World Multi-task Agents with Memory-Augmented Multimodal Language Models
Paper • 2311.05997 • Published • 37 -
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
Paper • 2311.05657 • Published • 32