Picture for Xu Wang

Xu Wang

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Add code
Jan 05, 2026
Viaarxiv icon

VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation

Add code
Jan 05, 2026
Viaarxiv icon

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon

Scaling Spatial Reasoning in MLLMs through Programmatic Data Synthesis

Add code
Dec 18, 2025
Figure 1 for Scaling Spatial Reasoning in MLLMs through Programmatic Data Synthesis
Figure 2 for Scaling Spatial Reasoning in MLLMs through Programmatic Data Synthesis
Figure 3 for Scaling Spatial Reasoning in MLLMs through Programmatic Data Synthesis
Figure 4 for Scaling Spatial Reasoning in MLLMs through Programmatic Data Synthesis
Viaarxiv icon

EagleVision: A Dual-Stage Framework with BEV-grounding-based Chain-of-Thought for Spatial Intelligence

Add code
Dec 17, 2025
Viaarxiv icon

Q-Doc: Benchmarking Document Image Quality Assessment Capabilities in Multi-modal Large Language Models

Add code
Nov 14, 2025
Viaarxiv icon

DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Instance Segmentation

Add code
Nov 13, 2025
Figure 1 for DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Instance Segmentation
Figure 2 for DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Instance Segmentation
Figure 3 for DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Instance Segmentation
Figure 4 for DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Instance Segmentation
Viaarxiv icon

Semantic-Consistent Bidirectional Contrastive Hashing for Noisy Multi-Label Cross-Modal Retrieval

Add code
Nov 11, 2025
Viaarxiv icon

MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control

Add code
Oct 01, 2025
Figure 1 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 2 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 3 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Figure 4 for MG2FlowNet: Accelerating High-Reward Sample Generation via Enhanced MCTS and Greediness Control
Viaarxiv icon