Picture for Yi Yang

Yi Yang

The Hong Kong University of Science and Technology, Hong Kong SAR, China

TASU2: Controllable CTC Simulation for Alignment and Low-Resource Adaptation of Speech LLMs

Add code
Apr 09, 2026
Viaarxiv icon

PhyEdit: Towards Real-World Object Manipulation via Physically-Grounded Image Editing

Add code
Apr 09, 2026
Viaarxiv icon

RefineAnything: Multimodal Region-Specific Refinement for Perfect Local Details

Add code
Apr 08, 2026
Viaarxiv icon

Flux Attention: Context-Aware Hybrid Attention for Efficient LLMs Inference

Add code
Apr 08, 2026
Viaarxiv icon

BiScale-GTR: Fragment-Aware Graph Transformers for Multi-Scale Molecular Representation Learning

Add code
Apr 07, 2026
Viaarxiv icon

Rendering Multi-Human and Multi-Object with 3D Gaussian Splatting

Add code
Apr 03, 2026
Viaarxiv icon

TRACE: High-Fidelity 3D Scene Editing via Tangible Reconstruction and Geometry-Aligned Contextual Video Masking

Add code
Apr 01, 2026
Viaarxiv icon

LogiStory: A Logic-Aware Framework for Multi-Image Story Visualization

Add code
Mar 30, 2026
Viaarxiv icon

FilterGS: Traversal-Free Parallel Filtering and Adaptive Shrinking for Large-Scale LoD 3D Gaussian Splatting

Add code
Mar 25, 2026
Viaarxiv icon

FoleyDirector: Fine-Grained Temporal Steering for Video-to-Audio Generation via Structured Scripts

Add code
Mar 20, 2026
Viaarxiv icon