4 19 8

Lu Yu

VoladorLuYu

AI & ML interests

Neuro-Symbolic, Large Language Models, Graph Machine Learning

Organizations

None yet

upvoted a paper 5 months ago

SWE-EVO: Benchmarking Coding Agents in Long-Horizon Software Evolution Scenarios

Paper • 2512.18470 • Published Dec 20, 2025 • 12

upvoted a paper 7 months ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22, 2025 • 117

upvoted an article 10 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 775

upvoted 2 collections almost 2 years ago

Synthetic Data and Self-Improvement

Collection

113 items • Updated Sep 26, 2025 • 9

Foundation Models

Collection

76 items • Updated Nov 21, 2025 • 1

upvoted a collection about 2 years ago

Diffusion model Spaces

Collection

309 items • Updated Mar 2 • 35

upvoted 3 papers about 2 years ago

Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance

Paper • 2403.16952 • Published Mar 25, 2024 • 1

Large Language Models Struggle to Learn Long-Tail Knowledge

Paper • 2211.08411 • Published Nov 15, 2022 • 3

In-Context Learning Creates Task Vectors

Paper • 2310.15916 • Published Oct 24, 2023 • 43

upvoted 2 collections about 2 years ago

Coding

Collection

195 items • Updated Jan 24 • 23

Diffusion Model

Collection

49 items • Updated Aug 19, 2024 • 9

upvoted a paper about 2 years ago

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

Paper • 2402.01680 • Published Jan 21, 2024 • 2

upvoted a collection about 2 years ago

Reasoning | Planning

Collection

27 items • Updated Dec 22, 2024 • 6

upvoted a paper over 2 years ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 83

upvoted a collection over 2 years ago

RL/Alignment

Collection

202 items • Updated Jan 15 • 27

upvoted 3 papers over 2 years ago

How Does Generative Retrieval Scale to Millions of Passages?

Paper • 2305.11841 • Published May 19, 2023 • 4

Can LLMs Follow Simple Rules?

Paper • 2311.04235 • Published Nov 6, 2023 • 13

Enhancing Zero-Shot Chain-of-Thought Reasoning in Large Language Models through Logic

Paper • 2309.13339 • Published Sep 23, 2023 • 3

Lu Yu

AI & ML interests

Organizations

VoladorLuYu's activity

SmolLM3: smol, multilingual, long-context reasoner