Picture for Meng Cao

Meng Cao

CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal

Add code
Dec 22, 2025
Viaarxiv icon

GLaD: Geometric Latent Distillation for Vision-Language-Action Models

Add code
Dec 10, 2025
Figure 1 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 2 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 3 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Figure 4 for GLaD: Geometric Latent Distillation for Vision-Language-Action Models
Viaarxiv icon

SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery

Add code
Dec 08, 2025
Viaarxiv icon

Video Spatial Reasoning with Object-Centric 3D Rollout

Add code
Nov 17, 2025
Figure 1 for Video Spatial Reasoning with Object-Centric 3D Rollout
Figure 2 for Video Spatial Reasoning with Object-Centric 3D Rollout
Figure 3 for Video Spatial Reasoning with Object-Centric 3D Rollout
Figure 4 for Video Spatial Reasoning with Object-Centric 3D Rollout
Viaarxiv icon

Beyond Observations: Reconstruction Error-Guided Irregularly Sampled Time Series Representation Learning

Add code
Nov 15, 2025
Viaarxiv icon

COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization

Add code
Oct 08, 2025
Figure 1 for COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization
Figure 2 for COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization
Figure 3 for COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization
Figure 4 for COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization
Viaarxiv icon

Checklists Are Better Than Reward Models For Aligning Language Models

Add code
Jul 24, 2025
Figure 1 for Checklists Are Better Than Reward Models For Aligning Language Models
Figure 2 for Checklists Are Better Than Reward Models For Aligning Language Models
Figure 3 for Checklists Are Better Than Reward Models For Aligning Language Models
Figure 4 for Checklists Are Better Than Reward Models For Aligning Language Models
Viaarxiv icon

C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning

Add code
Jul 22, 2025
Figure 1 for C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Figure 2 for C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Figure 3 for C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Figure 4 for C2-Evo: Co-Evolving Multimodal Data and Model for Self-Improving Reasoning
Viaarxiv icon

PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly

Add code
Jun 10, 2025
Viaarxiv icon

Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation

Add code
May 29, 2025
Figure 1 for Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation
Figure 2 for Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation
Figure 3 for Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation
Figure 4 for Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation
Viaarxiv icon