🎯 Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated 6 days ago • 106
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published Jan 14, 2025 • 62
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model Paper • 2508.14444 • Published Aug 20, 2025 • 40
Medical QA Datasets Collection A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22, 2025 • 47
QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training Paper • 2506.00711 • Published May 31, 2025 • 1
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published Feb 25, 2025 • 75
Phi-4 Collection Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 192
DeepSeek-R1-ReDistill Collection Re-distilled DeepSeek R1 models • 4 items • Updated Jan 30, 2025 • 15
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper • 2412.17498 • Published Dec 23, 2024 • 22
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging Aug 19, 2024 • 79
Skywork-Reward-Data-Collection Collection Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12, 2024 • 21
HelpSteer2: Open-source dataset for training top-performing reward models Paper • 2406.08673 • Published Jun 12, 2024 • 19
Xwin-LM: Strong and Scalable Alignment Practice for LLMs Paper • 2405.20335 • Published May 30, 2024 • 17
Biomedical NLP papers Collection Papers posted on @[email protected] (Clinical, Healthcare & Biomedical NLP) • 183 items • Updated Jan 24, 2025 • 43