Picture for Yanye Lu

Yanye Lu

AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs

Add code
Nov 18, 2025
Figure 1 for AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
Figure 2 for AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
Figure 3 for AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
Figure 4 for AdaTok: Adaptive Token Compression with Object-Aware Representations for Efficient Multimodal LLMs
Viaarxiv icon

Omni-View: Unlocking How Generation Facilitates Understanding in Unified 3D Model based on Multiview images

Add code
Nov 10, 2025
Viaarxiv icon

Novel Extraction of Discriminative Fine-Grained Feature to Improve Retinal Vessel Segmentation

Add code
May 06, 2025
Viaarxiv icon

SuperCL: Superpixel Guided Contrastive Learning for Medical Image Segmentation Pre-training

Add code
Apr 20, 2025
Viaarxiv icon

Exploiting Inherent Class Label: Towards Robust Scribble Supervised Semantic Segmentation

Add code
Mar 18, 2025
Viaarxiv icon

Universal Image Restoration Pre-training via Degradation Classification

Add code
Jan 26, 2025
Viaarxiv icon

V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer

Add code
Jan 09, 2025
Figure 1 for V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Figure 2 for V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Figure 3 for V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Figure 4 for V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Viaarxiv icon

Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation

Add code
Dec 19, 2024
Figure 1 for Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation
Figure 2 for Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation
Figure 3 for Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation
Figure 4 for Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation
Viaarxiv icon

Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation

Add code
Jun 19, 2024
Figure 1 for Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation
Figure 2 for Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation
Figure 3 for Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation
Figure 4 for Low-Rank Mixture-of-Experts for Continual Medical Image Segmentation
Viaarxiv icon

Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%

Add code
Jun 17, 2024
Figure 1 for Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Figure 2 for Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Figure 3 for Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Figure 4 for Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%
Viaarxiv icon