Picture for Weihao Yu

Weihao Yu

X-GRM: Large Gaussian Reconstruction Model for Sparse-view X-rays to Computed Tomography

Add code
May 21, 2025
Viaarxiv icon

MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models

Add code
May 21, 2025
Viaarxiv icon

Emerging Properties in Unified Multimodal Pretraining

Add code
May 20, 2025
Viaarxiv icon

Top-Down Compression: Revisit Efficient Vision Token Projection for Visual Instruction Tuning

Add code
May 17, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Viaarxiv icon

X$^{2}$-Gaussian: 4D Radiative Gaussian Splatting for Continuous-time Tomographic Reconstruction

Add code
Mar 27, 2025
Viaarxiv icon

Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion

Add code
Jan 29, 2025
Figure 1 for Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Figure 2 for Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Figure 3 for Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Figure 4 for Polyp-Gen: Realistic and Diverse Polyp Image Generation for Endoscopic Dataset Expansion
Viaarxiv icon

Attention Prompting on Image for Large Vision-Language Models

Add code
Sep 25, 2024
Viaarxiv icon

LinFusion: 1 GPU, 1 Minute, 16K Image

Add code
Sep 03, 2024
Viaarxiv icon

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Add code
Aug 01, 2024
Figure 1 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 2 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 3 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Figure 4 for MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities
Viaarxiv icon