Picture for You He

You He

Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation

Add code
Nov 12, 2025
Viaarxiv icon

A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects

Add code
Jun 24, 2025
Figure 1 for A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects
Figure 2 for A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects
Figure 3 for A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects
Figure 4 for A Survey of Multi-sensor Fusion Perception for Embodied AI: Background, Methods, Challenges and Prospects
Viaarxiv icon

EffOWT: Transfer Visual Language Models to Open-World Tracking Efficiently and Effectively

Add code
Apr 09, 2025
Viaarxiv icon

ReNeg: Learning Negative Embedding with Reward Guidance

Add code
Dec 27, 2024
Viaarxiv icon

GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection

Add code
Dec 15, 2024
Figure 1 for GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
Figure 2 for GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
Figure 3 for GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
Figure 4 for GaLore$+$: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection
Viaarxiv icon

MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models

Add code
Dec 02, 2024
Figure 1 for MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models
Figure 2 for MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models
Figure 3 for MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models
Figure 4 for MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models
Viaarxiv icon

LLMs Can Evolve Continually on Modality for X-Modal Reasoning

Add code
Oct 26, 2024
Viaarxiv icon

GLRT-Based Metric Learning for Remote Sensing Object Retrieval

Add code
Oct 08, 2024
Figure 1 for GLRT-Based Metric Learning for Remote Sensing Object Retrieval
Figure 2 for GLRT-Based Metric Learning for Remote Sensing Object Retrieval
Figure 3 for GLRT-Based Metric Learning for Remote Sensing Object Retrieval
Figure 4 for GLRT-Based Metric Learning for Remote Sensing Object Retrieval
Viaarxiv icon

Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters

Add code
Mar 18, 2024
Viaarxiv icon

Magicremover: Tuning-free Text-guided Image inpainting with Diffusion Models

Add code
Oct 04, 2023
Viaarxiv icon