Picture for Jie Zhou

Jie Zhou

Evaluating Accounting Reasoning Capabilities of Large Language Models

Add code
Jan 10, 2026
Viaarxiv icon

PsychEval: A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor

Add code
Jan 08, 2026
Viaarxiv icon

Figure It Out: Improve the Frontier of Reasoning with Executable Visual States

Add code
Jan 06, 2026
Viaarxiv icon

UltraEval-Audio: A Unified Framework for Comprehensive Evaluation of Audio Foundation Models

Add code
Jan 04, 2026
Viaarxiv icon

Figure It Out: Improving the Frontier of Reasoning with Active Visual Thinking

Add code
Dec 30, 2025
Viaarxiv icon

Improving Multi-step RAG with Hypergraph-based Memory for Long-Context Complex Relational Modeling

Add code
Dec 30, 2025
Viaarxiv icon

NeXT-IMDL: Build Benchmark for NeXT-Generation Image Manipulation Detection & Localization

Add code
Dec 29, 2025
Viaarxiv icon

WeDLM: Reconciling Diffusion Language Models with Standard Causal Attention for Fast Inference

Add code
Dec 28, 2025
Viaarxiv icon

Exploring the Vertical-Domain Reasoning Capabilities of Large Language Models

Add code
Dec 27, 2025
Viaarxiv icon

Mindscape-Aware Retrieval Augmented Generation for Improved Long Context Understanding

Add code
Dec 19, 2025
Viaarxiv icon