Qihan Ren's picture

2 15 2

Qihan Ren

jasonrqh

·

https://nebularaid2000.github.io/

AI & ML interests

explainable AI, LLM

Recent Activity

upvoted a paper 8 days ago

Geometrically-Constrained Agent for Spatial Reasoning

upvoted a paper 13 days ago

iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

upvoted a paper about 2 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

View all activity

Organizations

upvoted a paper 8 days ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published 12 days ago • 38

upvoted a paper 13 days ago

iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation

Paper • 2511.20635 • Published 14 days ago • 31

upvoted 3 papers about 2 months ago

Attention Illuminates LLM Reasoning: The Preplan-and-Anchor Rhythm Enables Fine-Grained Policy Optimization

Paper • 2510.13554 • Published Oct 15 • 57

AutoPR: Let's Automate Your Academic Promotion!

Paper • 2510.09558 • Published Oct 10 • 51

Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models

Paper • 2509.23962 • Published Sep 28 • 5

upvoted 7 papers 2 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9 • 22

CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards

Paper • 2510.08529 • Published Oct 9 • 18

Who's Your Judge? On the Detectability of LLM-Generated Judgments

Paper • 2509.25154 • Published Sep 29 • 29

Socratic-Zero : Bootstrapping Reasoning via Data-Free Agent Co-evolution

Paper • 2509.24726 • Published Sep 29 • 19

Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents

Paper • 2509.26354 • Published Sep 30 • 17

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2 • 80

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Paper • 2509.23924 • Published Sep 28 • 8

upvoted a paper 4 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 82

upvoted a paper 6 months ago

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Paper • 2505.20286 • Published May 26 • 8

upvoted a paper 8 months ago

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88