Picture for Kai Han

Kai Han

GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer

Add code
Jun 04, 2024
Figure 1 for GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
Figure 2 for GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
Figure 3 for GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
Figure 4 for GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer
Viaarxiv icon

SA-GS: Semantic-Aware Gaussian Splatting for Large Scene Reconstruction with Geometry Constrain

Add code
May 28, 2024
Viaarxiv icon

SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation

Add code
May 14, 2024
Figure 1 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 2 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 3 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Figure 4 for SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
Viaarxiv icon

EMS-SD: Efficient Multi-sample Speculative Decoding for Accelerating Large Language Models

Add code
May 13, 2024
Viaarxiv icon

Memory-Space Visual Prompting for Efficient Vision-Language Fine-Tuning

Add code
May 09, 2024
Viaarxiv icon

Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Add code
Apr 29, 2024
Viaarxiv icon

GhostNetV3: Exploring the Training Strategies for Compact Models

Add code
Apr 17, 2024
Figure 1 for GhostNetV3: Exploring the Training Strategies for Compact Models
Figure 2 for GhostNetV3: Exploring the Training Strategies for Compact Models
Figure 3 for GhostNetV3: Exploring the Training Strategies for Compact Models
Figure 4 for GhostNetV3: Exploring the Training Strategies for Compact Models
Viaarxiv icon

SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning

Add code
Mar 20, 2024
Viaarxiv icon

DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models

Add code
Mar 05, 2024
Figure 1 for DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models
Figure 2 for DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models
Figure 3 for DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models
Figure 4 for DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models
Viaarxiv icon

A robust audio deepfake detection system via multi-view feature

Add code
Mar 04, 2024
Viaarxiv icon