Thao Nguyen PRO

thaoshibe

·

https://thaoshibe.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset 16 days ago

thaoshibe/camroll-yfcc20

published a dataset 16 days ago

thaoshibe/camroll-yfcc20

commentedon a paper 16 days ago

From AGI to ASI

View all activity

Organizations

None yet

upvoted a paper 20 days ago

VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection

Paper • 2505.20289 • Published May 26, 2025 • 11

upvoted 2 papers 26 days ago

MAOAM: Unified Object and Material Selection with Vision-Language Models

Paper • 2606.04880 • Published 29 days ago • 10

Personal AI Agent for Camera Roll VQA

Paper • 2606.05275 • Published 28 days ago • 20

upvoted a paper about 1 month ago

From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing

Paper • 2605.15181 • Published May 14 • 12

upvoted a paper 3 months ago

Exploration and Exploitation Errors Are Measurable for Language Model Agents

Paper • 2604.13151 • Published Apr 14 • 25

upvoted a paper 5 months ago

Agentic Very Long Video Understanding

Paper • 2601.18157 • Published Jan 26 • 22

upvoted 2 collections 7 months ago

cool-papers

111 items • Updated May 22 • 11

Cool Architecture

6 items • Updated Dec 12, 2025 • 1

upvoted 3 papers 7 months ago

See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models

Paper • 2512.02231 • Published Dec 1, 2025 • 9

Visual Instruction Inversion: Image Editing via Visual Prompting

Paper • 2307.14331 • Published Jul 26, 2023 • 1

Relational Visual Similarity

Paper • 2512.07833 • Published Dec 8, 2025 • 25

upvoted 3 papers about 1 year ago

Yo'LLaVA: Your Personalized Language and Vision Assistant

Paper • 2406.09400 • Published Jun 13, 2024 • 2

YoChameleon: Personalized Vision and Language Generation

Paper • 2504.20998 • Published Apr 29, 2025 • 12

X-Fusion: Introducing New Modality to Frozen Large Language Models

Paper • 2504.20996 • Published Apr 29, 2025 • 13