Picture for Liang-Yan Gui

Liang-Yan Gui

InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

Add code
Sep 11, 2025
Viaarxiv icon

Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration

Add code
Sep 11, 2025
Viaarxiv icon

Refer to Anything with Vision-Language Prompts

Add code
Jun 05, 2025
Viaarxiv icon

Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought

Add code
May 29, 2025
Viaarxiv icon

InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object Interactions

Add code
Feb 27, 2025
Viaarxiv icon

Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision

Add code
Oct 10, 2024
Figure 1 for Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
Figure 2 for Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
Figure 3 for Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
Figure 4 for Emerging Pixel Grounding in Large Multimodal Models Without Grounding Supervision
Viaarxiv icon

Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding

Add code
Sep 05, 2024
Figure 1 for Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Figure 2 for Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Figure 3 for Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Figure 4 for Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
Viaarxiv icon

Floating No More: Object-Ground Reconstruction from a Single Image

Add code
Jul 26, 2024
Viaarxiv icon

Situational Awareness Matters in 3D Vision Language Reasoning

Add code
Jun 11, 2024
Figure 1 for Situational Awareness Matters in 3D Vision Language Reasoning
Figure 2 for Situational Awareness Matters in 3D Vision Language Reasoning
Figure 3 for Situational Awareness Matters in 3D Vision Language Reasoning
Figure 4 for Situational Awareness Matters in 3D Vision Language Reasoning
Viaarxiv icon

SOHES: Self-supervised Open-world Hierarchical Entity Segmentation

Add code
Apr 18, 2024
Viaarxiv icon