Picture for Shiji Song

Shiji Song

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Add code
Aug 11, 2024
Figure 1 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 2 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 3 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 4 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Viaarxiv icon

Rethinking the Architecture Design for Efficient Generic Event Boundary Detection

Add code
Jul 17, 2024
Viaarxiv icon

Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing

Add code
Jul 11, 2024
Figure 1 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 2 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 3 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Figure 4 for Model Surgery: Modulating LLM's Behavior Via Simple Parameter Editing
Viaarxiv icon

DyFADet: Dynamic Feature Aggregation for Temporal Action Detection

Add code
Jul 03, 2024
Figure 1 for DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Figure 2 for DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Figure 3 for DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Figure 4 for DyFADet: Dynamic Feature Aggregation for Temporal Action Detection
Viaarxiv icon

Structure-aware World Model for Probe Guidance via Large-scale Self-supervised Pre-train

Add code
Jun 28, 2024
Viaarxiv icon

Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model

Add code
Jun 19, 2024
Figure 1 for Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model
Figure 2 for Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model
Figure 3 for Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model
Figure 4 for Cardiac Copilot: Automatic Probe Guidance for Echocardiography with World Model
Viaarxiv icon

Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis

Add code
Jun 08, 2024
Figure 1 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 2 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 3 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Figure 4 for Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis
Viaarxiv icon

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

Add code
Jun 06, 2024
Figure 1 for Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Figure 2 for Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Figure 3 for Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Figure 4 for Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment
Viaarxiv icon

Demystify Mamba in Vision: A Linear Attention Perspective

Add code
May 26, 2024
Figure 1 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 2 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 3 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 4 for Demystify Mamba in Vision: A Linear Attention Perspective
Viaarxiv icon

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Add code
May 24, 2024
Viaarxiv icon