zwhy's picture

2 2

zwhy

XiaohuaWang

·

AI & ML interests

None yet

Recent Activity

authored a paper about 1 month ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

upvoted a paper about 1 month ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

updated a model 4 months ago

XiaohuaWang/math-interactive-rl

View all activity

Organizations

authored a paper about 1 month ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published Apr 15 • 32

upvoted a paper about 1 month ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published Apr 15 • 32

updated a model 4 months ago

XiaohuaWang/math-interactive-rl

published a model 4 months ago

XiaohuaWang/math-interactive-rl

upvoted a paper 4 months ago

Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction

Paper • 2601.05107 • Published Jan 8 • 24

liked a Space 11 months ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked a dataset over 1 year ago

allenai/WildChat-1M

Viewer • Updated Oct 17, 2024 • 838k • 13.8k • 436

updated a dataset almost 2 years ago

FudanDNN-NLP/Wiki_Med_DB

Updated Jul 2, 2024 • 11

updated a model almost 2 years ago

FudanDNN-NLP/llama3-8b-instruct-ragga-disturb

Text Generation • 8B • Updated Jul 2, 2024 • 6 •