Picture for Zhifang Sui

Zhifang Sui

HistLens: Mapping Idea Change across Concepts and Corpora

Add code
Apr 13, 2026
Viaarxiv icon

Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents

Add code
Apr 07, 2026
Viaarxiv icon

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Add code
Mar 05, 2026
Viaarxiv icon

Towards Better RL Training Data Utilization via Second-Order Rollout

Add code
Feb 26, 2026
Viaarxiv icon

CoLT: Reasoning with Chain of Latent Tool Calls

Add code
Feb 04, 2026
Viaarxiv icon

Decoding in Geometry: Alleviating Embedding-Space Crowding for Complex Reasoning

Add code
Jan 30, 2026
Viaarxiv icon

TeachBench: A Syllabus-Grounded Framework for Evaluating Teaching Ability in Large Language Models

Add code
Jan 29, 2026
Viaarxiv icon

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

Add code
Dec 19, 2025
Viaarxiv icon

Towards Stable and Effective Reinforcement Learning for Mixture-of-Experts

Add code
Oct 27, 2025
Viaarxiv icon

LLM-REVal: Can We Trust LLM Reviewers Yet?

Add code
Oct 14, 2025
Figure 1 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 2 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 3 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Figure 4 for LLM-REVal: Can We Trust LLM Reviewers Yet?
Viaarxiv icon