Picture for Weiyuan Chen

Weiyuan Chen

PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference

Add code
Mar 03, 2026
Viaarxiv icon

EvoSkill: Automated Skill Discovery for Multi-Agent Systems

Add code
Mar 03, 2026
Viaarxiv icon

ROMA: Recursive Open Meta-Agent Framework for Long-Horizon Multi-Agent Systems

Add code
Feb 02, 2026
Viaarxiv icon

AbGen: Evaluating Large Language Models in Ablation Study Design and Evaluation for Scientific Research

Add code
Jul 17, 2025
Viaarxiv icon

MMVU: Measuring Expert-Level Multi-Discipline Video Understanding

Add code
Jan 21, 2025
Figure 1 for MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Figure 2 for MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Figure 3 for MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Figure 4 for MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Viaarxiv icon

FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents

Add code
Nov 08, 2024
Figure 1 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Figure 2 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Figure 3 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Figure 4 for FinDVer: Explainable Claim Verification over Long and Hybrid-Content Financial Documents
Viaarxiv icon