Picture for Weijiang Lv

Weijiang Lv

Act As a Real Researcher: A Suite of Benchmarks Evaluating Frontier LLMs and Agentic Harnesses in Research Lifecycle

Add code
Jun 05, 2026
Viaarxiv icon

GeoFaith: A Spatio-Temporal Dual View of Faithful Chain-of-Thought

Add code
May 26, 2026
Viaarxiv icon

Dual Prototype-Conditioned Diffusion Model for Scalable Multi-Class Unsupervised Anomaly Detection in Large Category Spaces

Add code
May 23, 2026
Viaarxiv icon

SPD-Faith Bench: Diagnosing and Improving Faithfulness in Chain-of-Thought for Multimodal Large Language Models

Add code
Feb 08, 2026
Viaarxiv icon