Picture for Xiaoyang Guo

Xiaoyang Guo

Adaptive-VoCo: Complexity-Aware Visual Token Compression for Vision-Language Models

Add code
Dec 20, 2025
Viaarxiv icon

UniPart: Part-Level 3D Generation with Unified 3D Geom-Seg Latents

Add code
Dec 10, 2025
Viaarxiv icon

HybridToken-VLM: Hybrid Token Compression for Vision-Language Models

Add code
Dec 09, 2025
Viaarxiv icon

MM-CoT:A Benchmark for Probing Visual Chain-of-Thought Reasoning in Multimodal Models

Add code
Dec 09, 2025
Viaarxiv icon

OccTENS: 3D Occupancy World Model via Temporal Next-Scale Prediction

Add code
Sep 04, 2025
Viaarxiv icon

SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization

Add code
Aug 25, 2025
Figure 1 for SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization
Figure 2 for SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization
Figure 3 for SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization
Figure 4 for SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization
Viaarxiv icon

MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization

Add code
Jul 10, 2025
Figure 1 for MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization
Figure 2 for MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization
Figure 3 for MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization
Figure 4 for MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization
Viaarxiv icon

Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging

Add code
Mar 31, 2025
Figure 1 for Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging
Figure 2 for Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging
Figure 3 for Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging
Figure 4 for Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging
Viaarxiv icon

AI Governance InternationaL Evaluation Index (AGILE Index)

Add code
Feb 26, 2025
Viaarxiv icon

RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning

Add code
Feb 18, 2025
Figure 1 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 2 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 3 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Figure 4 for RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning
Viaarxiv icon