Picture for Enhua Wu

Enhua Wu

VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse

Add code
Dec 16, 2025
Viaarxiv icon

Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs

Add code
Oct 14, 2024
Figure 1 for Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
Figure 2 for Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
Figure 3 for Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
Figure 4 for Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs
Viaarxiv icon

Denoising with a Joint-Embedding Predictive Architecture

Add code
Oct 02, 2024
Figure 1 for Denoising with a Joint-Embedding Predictive Architecture
Figure 2 for Denoising with a Joint-Embedding Predictive Architecture
Figure 3 for Denoising with a Joint-Embedding Predictive Architecture
Figure 4 for Denoising with a Joint-Embedding Predictive Architecture
Viaarxiv icon

Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input

Add code
Aug 28, 2024
Figure 1 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 2 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 3 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Figure 4 for Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
Viaarxiv icon

Deformable 3D Shape Diffusion Model

Add code
Jul 31, 2024
Figure 1 for Deformable 3D Shape Diffusion Model
Figure 2 for Deformable 3D Shape Diffusion Model
Figure 3 for Deformable 3D Shape Diffusion Model
Figure 4 for Deformable 3D Shape Diffusion Model
Viaarxiv icon

Fine-gained Zero-shot Video Sampling

Add code
Jul 31, 2024
Figure 1 for Fine-gained Zero-shot Video Sampling
Figure 2 for Fine-gained Zero-shot Video Sampling
Figure 3 for Fine-gained Zero-shot Video Sampling
Figure 4 for Fine-gained Zero-shot Video Sampling
Viaarxiv icon

Space-time Reinforcement Network for Video Object Segmentation

Add code
May 07, 2024
Viaarxiv icon

ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks

Add code
Jun 26, 2023
Figure 1 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Figure 2 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Figure 3 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Figure 4 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Viaarxiv icon

Robust and Efficient Memory Network for Video Object Segmentation

Add code
Apr 24, 2023
Figure 1 for Robust and Efficient Memory Network for Video Object Segmentation
Figure 2 for Robust and Efficient Memory Network for Video Object Segmentation
Figure 3 for Robust and Efficient Memory Network for Video Object Segmentation
Figure 4 for Robust and Efficient Memory Network for Video Object Segmentation
Viaarxiv icon

Bag of Tricks with Quantized Convolutional Neural Networks for image classification

Add code
Mar 13, 2023
Figure 1 for Bag of Tricks with Quantized Convolutional Neural Networks for image classification
Figure 2 for Bag of Tricks with Quantized Convolutional Neural Networks for image classification
Figure 3 for Bag of Tricks with Quantized Convolutional Neural Networks for image classification
Figure 4 for Bag of Tricks with Quantized Convolutional Neural Networks for image classification
Viaarxiv icon