Xiaoya Lu's picture

1 19

Xiaoya Lu

Ursulalala

·

ursulalujun

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

upvoted a paper 7 days ago

Geometrically-Constrained Agent for Spatial Reasoning

commented on a paper 7 days ago

Geometrically-Constrained Agent for Spatial Reasoning

View all activity

Organizations

upvoted a paper 5 days ago

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

Paper • 2512.02425 • Published 7 days ago • 22

upvoted a paper 7 days ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published 11 days ago • 38

commented a paper 7 days ago

Geometrically-Constrained Agent for Spatial Reasoning

Paper • 2511.22659 • Published 11 days ago • 38 •

upvoted 4 papers about 2 months ago

Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning

Paper • 2510.11027 • Published Oct 13 • 21

High-Fidelity Simulated Data Generation for Real-World Zero-Shot Robotic Manipulation Learning with Gaussian Splatting

Paper • 2510.10637 • Published Oct 12 • 12

BEAR: Benchmarking and Enhancing Multimodal Language Models for Atomic Embodied Capabilities

Paper • 2510.08759 • Published Oct 9 • 46

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9 • 22

upvoted 2 papers 2 months ago

Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step

Paper • 2509.23924 • Published Sep 28 • 8

How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective

Paper • 2509.18905 • Published Sep 23 • 29

upvoted 3 papers 4 months ago

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 97

Frontier AI Risk Management Framework in Practice: A Risk Analysis Technical Report

Paper • 2507.16534 • Published Jul 22 • 7

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15 • 64

upvoted 3 papers 5 months ago

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

Paper • 2506.16402 • Published Jun 19 • 1

X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability

Paper • 2502.09990 • Published Feb 14 • 1

Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues

Paper • 2410.10700 • Published Oct 14, 2024 • 3

authored 3 papers 5 months ago

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

Paper • 2506.16402 • Published Jun 19 • 1

Derail Yourself: Multi-turn LLM Jailbreak Attack through Self-discovered Clues

Paper • 2410.10700 • Published Oct 14, 2024 • 3

X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usability

Paper • 2502.09990 • Published Feb 14 • 1

updated a dataset 5 months ago

Ursulalala/IS_Bench_dataset

Viewer • Updated Jul 14 • 6.06k • 120

published a dataset 5 months ago

Ursulalala/IS_Bench_dataset

Viewer • Updated Jul 14 • 6.06k • 120