Picture for Weihao Yu

Weihao Yu

LightBagel: A Light-weighted, Double Fusion Framework for Unified Multimodal Understanding and Generation

Add code
Oct 27, 2025
Viaarxiv icon

Artificial Hippocampus Networks for Efficient Long-Context Modeling

Add code
Oct 08, 2025
Viaarxiv icon

AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

X-GRM: Large Gaussian Reconstruction Model for Sparse-view X-rays to Computed Tomography

Add code
May 21, 2025
Viaarxiv icon

MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models

Add code
May 21, 2025
Viaarxiv icon

Emerging Properties in Unified Multimodal Pretraining

Add code
May 20, 2025
Figure 1 for Emerging Properties in Unified Multimodal Pretraining
Figure 2 for Emerging Properties in Unified Multimodal Pretraining
Figure 3 for Emerging Properties in Unified Multimodal Pretraining
Figure 4 for Emerging Properties in Unified Multimodal Pretraining
Viaarxiv icon

Top-Down Compression: Revisit Efficient Vision Token Projection for Visual Instruction Tuning

Add code
May 17, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Viaarxiv icon

X$^{2}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction

Add code
Mar 27, 2025
Viaarxiv icon

Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion

Add code
Jan 29, 2025
Figure 1 for Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Figure 2 for Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Figure 3 for Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Figure 4 for Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Viaarxiv icon