Picture for Di Fu

Di Fu

Advancing User-Voice Interaction: Exploring Emotion-Aware Voice Assistants Through a Role-Swapping Approach

Add code
Feb 21, 2025
Viaarxiv icon

The 1st InterAI Workshop: Interactive AI for Human-centered Robotics

Add code
Sep 17, 2024
Viaarxiv icon

Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer

Add code
Aug 30, 2024
Figure 1 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 2 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 3 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Figure 4 for Vote&Mix: Plug-and-Play Token Reduction for Efficient Vision Transformer
Viaarxiv icon

MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models

Add code
Aug 30, 2024
Figure 1 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 2 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 3 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Figure 4 for MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
Viaarxiv icon

Shot Segmentation Based on Von Neumann Entropy for Key Frame Extraction

Add code
Aug 29, 2024
Figure 1 for Shot Segmentation Based on Von Neumann Entropy for Key Frame Extraction
Figure 2 for Shot Segmentation Based on Von Neumann Entropy for Key Frame Extraction
Figure 3 for Shot Segmentation Based on Von Neumann Entropy for Key Frame Extraction
Figure 4 for Shot Segmentation Based on Von Neumann Entropy for Key Frame Extraction
Viaarxiv icon

Decoupled Prompt-Adapter Tuning for Continual Activity Recognition

Add code
Jul 20, 2024
Figure 1 for Decoupled Prompt-Adapter Tuning for Continual Activity Recognition
Figure 2 for Decoupled Prompt-Adapter Tuning for Continual Activity Recognition
Figure 3 for Decoupled Prompt-Adapter Tuning for Continual Activity Recognition
Figure 4 for Decoupled Prompt-Adapter Tuning for Continual Activity Recognition
Viaarxiv icon

Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models

Add code
May 07, 2024
Figure 1 for Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models
Figure 2 for Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models
Figure 3 for Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models
Figure 4 for Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models
Viaarxiv icon

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Add code
Apr 02, 2024
Figure 1 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 2 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 3 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Figure 4 for Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
Viaarxiv icon

Human Impression of Humanoid Robots Mirroring Social Cues

Add code
Jan 22, 2024
Figure 1 for Human Impression of Humanoid Robots Mirroring Social Cues
Figure 2 for Human Impression of Humanoid Robots Mirroring Social Cues
Figure 3 for Human Impression of Humanoid Robots Mirroring Social Cues
Viaarxiv icon

The Emotional Dilemma: Influence of a Human-like Robot on Trust and Cooperation

Add code
Jul 06, 2023
Viaarxiv icon