Malkesh Dalia's picture

7 7

Malkesh Dalia

malkesh2911

·

AI & ML interests

None yet

Recent Activity

updated a collection 11 days ago

upvoted a paper 11 days ago

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

liked a model 18 days ago

google/gemma-7b

View all activity

Organizations

None yet

upvoted a paper 11 days ago

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

Paper • 2511.13288 • Published 20 days ago • 17

upvoted 2 papers 18 days ago

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 44

MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism

Paper • 2511.11373 • Published 22 days ago • 12

upvoted a paper 27 days ago

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Paper • 2509.23924 • Published Sep 28 • 8

upvoted 2 papers about 2 months ago

RAG-Anything: All-in-One RAG Framework

Paper • 2510.12323 • Published Oct 14 • 49

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 491

upvoted a paper 3 months ago

FlowRL: Matching Reward Distributions for LLM Reasoning

Paper • 2509.15207 • Published Sep 18 • 114