Base Model for TransMLA
mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
upvoted a paper 5 days ago
Generative Refinement Networks for Visual Synthesis upvoted a paper 19 days ago
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention authored a paper 21 days ago
LIFT: Improving Long Context Understanding of Large Language Models
through Long Input Fine-TuningOrganizations
None yet