Picture for Chun Yuan

Chun Yuan

Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era

Add code
Nov 08, 2025
Viaarxiv icon

Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications

Add code
Oct 31, 2025
Figure 1 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 2 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 3 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 4 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Viaarxiv icon

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Add code
Oct 14, 2025
Figure 1 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 2 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 3 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 4 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Viaarxiv icon

Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation

Add code
Aug 12, 2025
Figure 1 for Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
Figure 2 for Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
Figure 3 for Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
Figure 4 for Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
Viaarxiv icon

Text-guided Visual Prompt DINO for Generic Segmentation

Add code
Aug 08, 2025
Figure 1 for Text-guided Visual Prompt DINO for Generic Segmentation
Figure 2 for Text-guided Visual Prompt DINO for Generic Segmentation
Figure 3 for Text-guided Visual Prompt DINO for Generic Segmentation
Figure 4 for Text-guided Visual Prompt DINO for Generic Segmentation
Viaarxiv icon

UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis

Add code
Jul 02, 2025
Viaarxiv icon

SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought

Add code
May 30, 2025
Viaarxiv icon

A Simple Linear Patch Revives Layer-Pruned Large Language Models

Add code
May 30, 2025
Figure 1 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 2 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 3 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 4 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Viaarxiv icon

CLIP-AE: CLIP-assisted Cross-view Audio-Visual Enhancement for Unsupervised Temporal Action Localization

Add code
May 29, 2025
Viaarxiv icon

Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging

Add code
May 26, 2025
Figure 1 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 2 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 3 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Figure 4 for Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging
Viaarxiv icon