CAME: Confidence-guided Adaptive Memory Efficient Optimization Paper • 2307.02047 • Published Jul 5, 2023 • 2
MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration Paper • 2602.01734 • Published 9 days ago • 32