Picture for Yu Cheng

Yu Cheng

SEE: Continual Fine-tuning with Sequential Ensemble of Experts

Add code
Apr 09, 2025
Viaarxiv icon

TransMamba: Flexibly Switching between Transformer and Mamba

Add code
Mar 31, 2025
Viaarxiv icon

Strategies for decentralised UAV-based collisions monitoring in rugby

Add code
Mar 27, 2025
Figure 1 for Strategies for decentralised UAV-based collisions monitoring in rugby
Figure 2 for Strategies for decentralised UAV-based collisions monitoring in rugby
Figure 3 for Strategies for decentralised UAV-based collisions monitoring in rugby
Figure 4 for Strategies for decentralised UAV-based collisions monitoring in rugby
Viaarxiv icon

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Add code
Mar 27, 2025
Viaarxiv icon

LangBridge: Interpreting Image as a Combination of Language Embeddings

Add code
Mar 26, 2025
Figure 1 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 2 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 3 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Figure 4 for LangBridge: Interpreting Image as a Combination of Language Embeddings
Viaarxiv icon

ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning

Add code
Mar 25, 2025
Viaarxiv icon

LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models

Add code
Mar 18, 2025
Figure 1 for LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
Figure 2 for LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
Figure 3 for LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
Figure 4 for LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models
Viaarxiv icon

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Add code
Mar 17, 2025
Figure 1 for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration
Figure 2 for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration
Figure 3 for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration
Figure 4 for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration
Viaarxiv icon

BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries

Add code
Mar 16, 2025
Figure 1 for BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries
Figure 2 for BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries
Figure 3 for BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries
Figure 4 for BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries
Viaarxiv icon

Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts

Add code
Mar 07, 2025
Figure 1 for Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts
Figure 2 for Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts
Figure 3 for Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts
Figure 4 for Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts
Viaarxiv icon