Picture for Jianzhong Ju

Jianzhong Ju

Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously

Add code
Mar 12, 2026
Viaarxiv icon

MSJoE: Jointly Evolving MLLM and Sampler for Efficient Long-Form Video Understanding

Add code
Feb 26, 2026
Viaarxiv icon

ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding

Add code
Feb 26, 2026
Viaarxiv icon

Visual Para-Thinker: Divide-and-Conquer Reasoning for Visual Comprehension

Add code
Feb 10, 2026
Viaarxiv icon

GeoFocus: Blending Efficient Global-to-Local Perception for Multimodal Geometry Problem-Solving

Add code
Feb 09, 2026
Viaarxiv icon

Video-OPD: Efficient Post-Training of Multimodal Large Language Models for Temporal Video Grounding via On-Policy Distillation

Add code
Feb 03, 2026
Viaarxiv icon

Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

Federated Balanced Learning

Add code
Jan 20, 2026
Viaarxiv icon

Federated Joint Learning for Domain and Class Generalization

Add code
Jan 18, 2026
Viaarxiv icon

Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding

Add code
Jan 16, 2026
Viaarxiv icon