University of Southern California

university

Verified

https://www.usc.edu

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

farukakgul submitted a paper 9 days ago

Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning

Bill1235813 submitted a paper 29 days ago

Precise Debugging Benchmark: Is Your Model Debugging or Regenerating?

Quankai submitted a paper about 2 months ago

LOME: Learning Human-Object Manipulation with Action-Conditioned Egocentric World Model

View all activity

Papers

Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning

Precise Debugging Benchmark: Is Your Model Debugging or Regenerating?

View all Papers

submitted a paper to Daily Papers 9 days ago

Rethinking RL for LLM Reasoning: It's Sparse Policy Selection, Not Capability Learning

Paper • 2605.06241 • Published 13 days ago • 4

submitted a paper to Daily Papers 29 days ago

Precise Debugging Benchmark: Is Your Model Debugging or Regenerating?

Paper • 2604.17338 • Published Apr 19 • 4

submitted a paper to Daily Papers about 2 months ago

LOME: Learning Human-Object Manipulation with Action-Conditioned Egocentric World Model

Paper • 2603.27449 • Published Mar 28 • 6

in USC/README 3 months ago

Who runs this page?

#1 opened 3 months ago by

in USC/README 3 months ago

Who runs this page?

#1 opened 3 months ago by

in USC/README 3 months ago

Who runs this page?

#1 opened 3 months ago by

published a Space 3 months ago

README

in USC/README 3 months ago

Who runs this page?

#1 opened 3 months ago by

submitted a paper to Daily Papers 4 months ago

Are LLM Decisions Faithful to Verbal Confidence?

Paper • 2601.07767 • Published Jan 12 • 5

submitted a paper to Daily Papers 5 months ago

LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning

Paper • 2512.05325 • Published Dec 5, 2025 • 5

authored a paper 7 months ago

Flip-Flop Consistency: Unsupervised Training for Robustness to Prompt Perturbations in LLMs

Paper • 2510.14242 • Published Oct 16, 2025

authored a paper 10 months ago

Compositional Coordination for Multi-Robot Teams with Large Language Models

Paper • 2507.16068 • Published Jul 21, 2025

authored a paper 12 months ago

Localized Gaussian Splatting Editing with Contextual Awareness

Paper • 2408.00083 • Published Jul 31, 2024

authored a paper over 1 year ago

DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 66

authored 6 papers about 2 years ago

From Words to Routes: Applying Large Language Models to Vehicle Routing

Paper • 2403.10795 • Published Mar 16, 2024

HyperPPO: A scalable method for finding small policies for robotic control

Paper • 2309.16663 • Published Sep 28, 2023

Decentralized Control of Quadrotor Swarms with End-to-end Deep Reinforcement Learning

Paper • 2109.07735 • Published Sep 16, 2021 • 1

Sample Factory: Egocentric 3D Control from Pixels at 100000 FPS with Asynchronous Reinforcement Learning

Paper • 2006.11751 • Published Jun 21, 2020

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

Paper • 2306.09537 • Published Jun 15, 2023

Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning

Paper • 2309.13285 • Published Sep 23, 2023