Xiao Hu's picture

3 3

Xiao Hu

huxiao09

·

huxiao09

AI & ML interests

Reinforcement Learning, LLM Reasoning

Recent Activity

liked a model 17 days ago

Kwai-Keye/Keye-VL-671B-A37B

upvoted a paper 4 months ago

Thyme: Think Beyond Images

authored a paper 5 months ago

Query-Policy Misalignment in Preference-Based Reinforcement Learning

View all activity

Organizations

None yet

Papers 5

arxiv:2507.01949

arxiv:2505.21067

arxiv:2505.02835

arxiv:2402.03046

models 0

None public yet

datasets 0

None public yet