1 89 33

Kyu Song

kyunocap

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

upvoted a paper 1 day ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

liked a model 4 days ago

lovis93/next-scene-qwen-image-lora-2509

View all activity

Organizations

None yet

upvoted a paper about 4 hours ago

LTX-2: Efficient Joint Audio-Visual Foundation Model

Paper • 2601.03233 • Published 10 days ago • 121

upvoted a paper 1 day ago

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Paper • 2601.09575 • Published 2 days ago • 23

liked a model 4 days ago

lovis93/next-scene-qwen-image-lora-2509

Image-to-Image • Updated Oct 21, 2025 • 48.8k • • 567

liked a Space 4 days ago

LTX-2 Video Fast

🎥

117

Fast high quality video with audio generation

upvoted 2 papers 4 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 8 days ago • 191

Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Paper • 2601.05432 • Published 8 days ago • 157

upvoted a paper 9 days ago

InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published 10 days ago • 95

liked a model about 1 month ago

facebook/pe-av-large

2B • Updated 24 days ago • 823 • 46

upvoted 12 papers about 1 month ago

EgoX: Egocentric Video Generation from a Single Exocentric Video

Paper • 2512.08269 • Published Dec 9, 2025 • 117

Memory in the Age of AI Agents

Paper • 2512.13564 • Published Dec 15, 2025 • 143

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Paper • 2512.10881 • Published Dec 11, 2025 • 28

Agentic Context Engineering: Evolving Contexts for Self-Improving Language Models

Paper • 2510.04618 • Published Oct 6, 2025 • 128

Composing Concepts from Images and Videos via Concept-prompt Binding

Paper • 2512.09824 • Published Dec 10, 2025 • 27

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published Dec 10, 2025 • 46

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Paper • 2512.07831 • Published Dec 8, 2025 • 16

Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality

Paper • 2512.07951 • Published Dec 8, 2025 • 48

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published Dec 9, 2025 • 130

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

Paper • 2512.00473 • Published Nov 29, 2025 • 25

Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs

Paper • 2512.07525 • Published Dec 8, 2025 • 58

Unified Video Editing with Temporal Reasoner

Paper • 2512.07469 • Published Dec 8, 2025 • 45

Kyu Song

AI & ML interests

Recent Activity

Organizations

kyunocap's activity

LTX-2 Video Fast