Picture for Zhi Zheng

Zhi Zheng

ADRD-Bench: A Preliminary LLM Benchmark for Alzheimer's Disease and Related Dementias

Add code
Feb 12, 2026
Viaarxiv icon

Identifying Evidence-Based Nudges in Biomedical Literature with Large Language Models

Add code
Feb 10, 2026
Viaarxiv icon

Data Science and Technology Towards AGI Part I: Tiered Data Management

Add code
Feb 09, 2026
Viaarxiv icon

Beyond Imitation: Reinforcement Learning for Active Latent Planning

Add code
Jan 29, 2026
Viaarxiv icon

Token-level Collaborative Alignment for LLM-based Generative Recommendation

Add code
Jan 26, 2026
Viaarxiv icon

Self-Manager: Parallel Agent Loop for Long-form Deep Research

Add code
Jan 25, 2026
Viaarxiv icon

VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit

Add code
Jan 09, 2026
Viaarxiv icon

DynaDebate: Breaking Homogeneity in Multi-Agent Debate with Dynamic Path Generation

Add code
Jan 09, 2026
Viaarxiv icon

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Add code
Nov 09, 2025
Viaarxiv icon

A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning

Add code
Sep 26, 2025
Figure 1 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 2 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 3 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 4 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Viaarxiv icon