Picture for Yusen Zhang

Yusen Zhang

Training Step-Level Reasoning Verifiers with Formal Verification Tools

Add code
May 21, 2025
Viaarxiv icon

NeuroGen: Neural Network Parameter Generation via Large Language Models

Add code
May 18, 2025
Viaarxiv icon

HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?

Add code
Apr 29, 2025
Viaarxiv icon

GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization

Add code
Apr 04, 2025
Viaarxiv icon

When Reasoning Meets Compression: Benchmarking Compressed Large Reasoning Models on Complex Reasoning Tasks

Add code
Apr 02, 2025
Viaarxiv icon

GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Add code
Dec 12, 2024
Viaarxiv icon

Coverage-based Fairness in Multi-document Summarization

Add code
Dec 11, 2024
Viaarxiv icon

VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information

Add code
Dec 01, 2024
Viaarxiv icon

Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models

Add code
Nov 12, 2024
Figure 1 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 2 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 3 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 4 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Viaarxiv icon

AAAR-1.0: Assessing AI's Potential to Assist Research

Add code
Oct 29, 2024
Viaarxiv icon