superpeng (peng)

upvoted a paper 4 months ago

Fast Segment Anything

Paper • 2306.12156 • Published Jun 21, 2023 • 35

upvoted 2 collections 4 months ago

🎯 Liquid Nanos

Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated 6 days ago • 106

Papers to Read

Collection

208 items • Updated Aug 24, 2025 • 10

upvoted a paper 4 months ago

JudgeLRM: Large Reasoning Models as a Judge

Paper • 2504.00050 • Published Mar 31, 2025 • 62

upvoted a collection 4 months ago

2025 LLM Papers on Hugging Face with Japanese Memos

Collection

78 items • Updated Apr 29, 2025 • 2

upvoted 2 papers 4 months ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published Jan 14, 2025 • 62

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published Mar 8, 2025 • 10

upvoted a paper 5 months ago

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

Paper • 2508.14444 • Published Aug 20, 2025 • 40

upvoted 2 collections 6 months ago

II-Medical

Collection

9 items • Updated Jul 4, 2025 • 15

Medical QA Datasets

Collection

A collection of medical question answering (QA) datasets • 23 items • Updated Feb 22, 2025 • 47

upvoted a paper 7 months ago

QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training

Paper • 2506.00711 • Published May 31, 2025 • 1

upvoted a paper 11 months ago

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25, 2025 • 75

upvoted 2 collections 11 months ago

Phi-4

Collection

Phi-4 family of small language, multi-modal and reasoning models. • 17 items • Updated Jul 10, 2025 • 192

DeepSeek-R1-ReDistill

Collection

Re-distilled DeepSeek R1 models • 4 items • Updated Jan 30, 2025 • 15

upvoted a paper about 1 year ago

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Paper • 2412.17498 • Published Dec 23, 2024 • 22

upvoted an article about 1 year ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Aug 19, 2024

•

79

upvoted a collection about 1 year ago

Skywork-Reward-Data-Collection

Collection

Open-source preference datasets used to train the Skywork reward model series • 17 items • Updated Oct 12, 2024 • 21

upvoted 2 papers over 1 year ago

HelpSteer2: Open-source dataset for training top-performing reward models

Paper • 2406.08673 • Published Jun 12, 2024 • 19

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Paper • 2405.20335 • Published May 30, 2024 • 17

upvoted a collection over 1 year ago

Biomedical NLP papers

Collection

Papers posted on @[email protected] (Clinical, Healthcare & Biomedical NLP) • 183 items • Updated Jan 24, 2025 • 43

peng

AI & ML interests

Organizations

Fast Segment Anything

🎯 Liquid Nanos

Papers to Read

JudgeLRM: Large Reasoning Models as a Judge

2025 LLM Papers on Hugging Face with Japanese Memos

Towards Best Practices for Open Datasets for LLM Training

A Survey on Post-training of Large Language Models

NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model

II-Medical

Medical QA Datasets

QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO Training

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Phi-4

DeepSeek-R1-ReDistill

DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

Skywork-Reward-Data-Collection

HelpSteer2: Open-source dataset for training top-performing reward models

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Biomedical NLP papers

peng

AI & ML interests

Organizations

superpeng's activity

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging