Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLVER
's Collections
RLVER
RLVER
updated
Jul 8, 2025
Checkpoints trained via RLVER, the first RLVR framework to boost LLM empathy.
Upvote
-
RLVER/PPO-non-thinking
8B
•
Updated
Jul 9, 2025
•
2
•
1
RLVER/GRPO-thinking
8B
•
Updated
Jul 9, 2025
•
5
RLVER/PPO-thinking
8B
•
Updated
Jul 9, 2025
•
5
RLVER/GRPO-non-thinking
8B
•
Updated
Jul 9, 2025
•
3
Upvote
-
Share collection
View history
Collection guide
Browse collections