Picture for Shiji Song

Shiji Song

FLatten Transformer: Vision Transformer using Focused Linear Attention

Add code
Aug 01, 2023
Viaarxiv icon

Dynamic Perceiver for Efficient Visual Recognition

Add code
Jun 20, 2023
Figure 1 for Dynamic Perceiver for Efficient Visual Recognition
Figure 2 for Dynamic Perceiver for Efficient Visual Recognition
Figure 3 for Dynamic Perceiver for Efficient Visual Recognition
Figure 4 for Dynamic Perceiver for Efficient Visual Recognition
Viaarxiv icon

Offline Prioritized Experience Replay

Add code
Jun 08, 2023
Viaarxiv icon

Boosting Offline Reinforcement Learning with Action Preference Query

Add code
Jun 06, 2023
Viaarxiv icon

Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention

Add code
Apr 09, 2023
Figure 1 for Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Figure 2 for Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Figure 3 for Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Figure 4 for Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention
Viaarxiv icon

Zero-shot Generative Model Adaptation via Image-specific Prompt Learning

Add code
Apr 06, 2023
Viaarxiv icon

Adaptive Rotated Convolution for Rotated Object Detection

Add code
Mar 14, 2023
Figure 1 for Adaptive Rotated Convolution for Rotated Object Detection
Figure 2 for Adaptive Rotated Convolution for Rotated Object Detection
Figure 3 for Adaptive Rotated Convolution for Rotated Object Detection
Figure 4 for Adaptive Rotated Convolution for Rotated Object Detection
Viaarxiv icon

Joint Representation Learning for Text and 3D Point Cloud

Add code
Jan 18, 2023
Viaarxiv icon

EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones

Add code
Nov 17, 2022
Figure 1 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 2 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 3 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Figure 4 for EfficientTrain: Exploring Generalized Curriculum Learning for Training Visual Backbones
Viaarxiv icon

Cross-Modal Adapter for Text-Video Retrieval

Add code
Nov 17, 2022
Viaarxiv icon