Picture for Zihao Yue

Zihao Yue

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

TimeZero: Temporal Video Grounding with Reasoning-Guided LVLM

Add code
Mar 17, 2025
Viaarxiv icon

Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

Add code
Mar 09, 2025
Viaarxiv icon

Movie101v2: Improved Movie Narration Benchmark

Add code
Apr 20, 2024
Figure 1 for Movie101v2: Improved Movie Narration Benchmark
Figure 2 for Movie101v2: Improved Movie Narration Benchmark
Figure 3 for Movie101v2: Improved Movie Narration Benchmark
Figure 4 for Movie101v2: Improved Movie Narration Benchmark
Viaarxiv icon

Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective

Add code
Feb 22, 2024
Viaarxiv icon

Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation

Add code
Jun 27, 2023
Figure 1 for Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation
Figure 2 for Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation
Figure 3 for Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation
Figure 4 for Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation
Viaarxiv icon

Movie101: A New Movie Understanding Benchmark

Add code
May 20, 2023
Figure 1 for Movie101: A New Movie Understanding Benchmark
Figure 2 for Movie101: A New Movie Understanding Benchmark
Figure 3 for Movie101: A New Movie Understanding Benchmark
Figure 4 for Movie101: A New Movie Understanding Benchmark
Viaarxiv icon