Picture for Jiaze Li

Jiaze Li

MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding

Add code
Feb 26, 2026
Viaarxiv icon

Joint Orientation and Weight Optimization for Robust Watertight Surface Reconstruction via Dirichlet-Regularized Winding Fields

Add code
Feb 14, 2026
Viaarxiv icon

Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension

Add code
Feb 10, 2026
Viaarxiv icon

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation

Add code
Feb 03, 2026
Viaarxiv icon

SOMBRERO: Measuring and Steering Boundary Placement in End-to-End Hierarchical Sequence Models

Add code
Jan 30, 2026
Viaarxiv icon

Federated Balanced Learning

Add code
Jan 20, 2026
Viaarxiv icon

Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model

Add code
Jan 20, 2026
Viaarxiv icon

Federated Joint Learning for Domain and Class Generalization

Add code
Jan 18, 2026
Viaarxiv icon

Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding

Add code
Jan 16, 2026
Viaarxiv icon

Xiaomi MiMo-VL-Miloco Technical Report

Add code
Dec 22, 2025
Figure 1 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 2 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 3 for Xiaomi MiMo-VL-Miloco Technical Report
Figure 4 for Xiaomi MiMo-VL-Miloco Technical Report
Viaarxiv icon