Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Privileged On-Policy Exploration

Team
classroom
Activity Feed

AI & ML interests

None defined yet.

Yuxiao Qu's profile picture

models 48

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4096_40_with_reasoning_mbz_1021

8B • Updated Oct 23 • 18

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4096_40_without_reasoning_mbz_1021

8B • Updated Oct 23 • 3

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-1024_160_with_reasoning_mbz_1021

8B • Updated Oct 23 • 6

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-1024_160_without_reasoning_mbz_1021

8B • Updated Oct 23 • 3

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-256_640_with_reasoning_mbz_1021

8B • Updated Oct 22 • 5

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-256_640_without_reasoning_mbz_1021

8B • Updated Oct 22 • 5

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-64_2560_with_reasoning_mbz_1021

8B • Updated Oct 21 • 4

CMU-POPE/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-64_2560_without_reasoning_mbz_1021

8B • Updated Oct 21 • 4

CMU-POPE/Instruct-POPE-hard-no_guide

4B • Updated Oct 11 • 4

CMU-POPE/Instruct-HARD-ALL-gemini_first_no-guide

4B • Updated Oct 6 • 4
View 48 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs