Picture for Meng Cao

Meng Cao

MR. Judge: Multimodal Reasoner as a Judge

Add code
May 19, 2025
Viaarxiv icon

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

Add code
May 08, 2025
Viaarxiv icon

BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese

Add code
May 01, 2025
Figure 1 for BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese
Figure 2 for BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese
Figure 3 for BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese
Figure 4 for BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese
Viaarxiv icon

A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation

Add code
Apr 21, 2025
Figure 1 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 2 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 3 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Figure 4 for A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation
Viaarxiv icon

IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs

Add code
Apr 21, 2025
Viaarxiv icon

The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction

Add code
Mar 29, 2025
Viaarxiv icon

SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding

Add code
Mar 27, 2025
Viaarxiv icon

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models

Add code
Mar 24, 2025
Figure 1 for Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
Figure 2 for Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
Figure 3 for Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
Figure 4 for Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
Viaarxiv icon

TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba

Add code
Feb 21, 2025
Figure 1 for TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
Figure 2 for TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
Figure 3 for TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
Figure 4 for TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba
Viaarxiv icon

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Add code
Feb 20, 2025
Viaarxiv icon