Picture for Chi-Wing Fu

Chi-Wing Fu

EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning

Add code
Jan 27, 2026
Viaarxiv icon

IdentityStory: Taming Your Identity-Preserving Generator for Human-Centric Story Generation

Add code
Dec 29, 2025
Viaarxiv icon

OpenView: Empowering MLLMs with Out-of-view VQA

Add code
Dec 21, 2025
Viaarxiv icon

DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis

Add code
Oct 02, 2025
Figure 1 for DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis
Figure 2 for DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis
Figure 3 for DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis
Figure 4 for DisCo-Layout: Disentangling and Coordinating Semantic and Physical Refinement in a Multi-Agent Framework for 3D Indoor Layout Synthesis
Viaarxiv icon

Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making

Add code
May 27, 2025
Viaarxiv icon

ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay

Add code
May 22, 2025
Figure 1 for ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay
Figure 2 for ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay
Figure 3 for ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay
Figure 4 for ARPO:End-to-End Policy Optimization for GUI Agents with Experience Replay
Viaarxiv icon

OPA-Pack: Object-Property-Aware Robotic Bin Packing

Add code
May 19, 2025
Viaarxiv icon

Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation

Add code
May 19, 2025
Viaarxiv icon

Hand-Shadow Poser

Add code
May 11, 2025
Figure 1 for Hand-Shadow Poser
Figure 2 for Hand-Shadow Poser
Figure 3 for Hand-Shadow Poser
Figure 4 for Hand-Shadow Poser
Viaarxiv icon

EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning

Add code
May 07, 2025
Viaarxiv icon