BytedanceDouyinContent/SAILViT-Huge-600M-448px Image Feature Extraction • 0.7B • Updated Jul 3 • 48 • 3
BytedanceDouyinContent/SAILViT-Large-300M-448px Image Feature Extraction • 0.3B • Updated Jul 3 • 274 • 2
SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model Paper • 2510.12709 • Published Oct 14 • 12
Scalable Vision Language Model Training via High Quality Data Curation Paper • 2501.05952 • Published Jan 10 • 5
SAILViT: Towards Robust and Generalizable Visual Backbones for MLLMs via Gradual Feature Refinement Paper • 2507.01643 • Published Jul 2 • 2
MEML-GRPO: Heterogeneous Multi-Expert Mutual Learning for RLVR Advancement Paper • 2508.09670 • Published Aug 13