Picture for Yun Fu

Yun Fu

Accessing Vision Foundation Models at ImageNet-level Costs

Add code
Jul 15, 2024
Viaarxiv icon

SoupLM: Model Integration in Large Language and Multi-Modal Models

Add code
Jul 11, 2024
Viaarxiv icon

Through the Theory of Mind's Eye: Reading Minds with Multimodal Video Large Language Models

Add code
Jun 19, 2024
Viaarxiv icon

Deciphering Movement: Unified Trajectory Generation Model for Multi-Agent

Add code
May 27, 2024
Viaarxiv icon

Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering

Add code
Apr 16, 2024
Figure 1 for Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Figure 2 for Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Figure 3 for Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Figure 4 for Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering
Viaarxiv icon

Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement

Add code
Apr 06, 2024
Figure 1 for Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
Figure 2 for Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
Figure 3 for Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
Figure 4 for Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement
Viaarxiv icon

OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising

Add code
Apr 02, 2024
Figure 1 for OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
Figure 2 for OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
Figure 3 for OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
Figure 4 for OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
Viaarxiv icon

Adapting to Length Shift: FlexiLength Network for Trajectory Prediction

Add code
Mar 31, 2024
Figure 1 for Adapting to Length Shift: FlexiLength Network for Trajectory Prediction
Figure 2 for Adapting to Length Shift: FlexiLength Network for Trajectory Prediction
Figure 3 for Adapting to Length Shift: FlexiLength Network for Trajectory Prediction
Figure 4 for Adapting to Length Shift: FlexiLength Network for Trajectory Prediction
Viaarxiv icon

Rewrite the Stars

Add code
Mar 29, 2024
Figure 1 for Rewrite the Stars
Figure 2 for Rewrite the Stars
Figure 3 for Rewrite the Stars
Figure 4 for Rewrite the Stars
Viaarxiv icon

Efficient Modulation for Vision Networks

Add code
Mar 29, 2024
Figure 1 for Efficient Modulation for Vision Networks
Figure 2 for Efficient Modulation for Vision Networks
Figure 3 for Efficient Modulation for Vision Networks
Figure 4 for Efficient Modulation for Vision Networks
Viaarxiv icon