Picture for Zhi Zheng

Zhi Zheng

Data Science and Technology Towards AGI Part I: Tiered Data Management

Add code
Feb 09, 2026
Viaarxiv icon

Beyond Imitation: Reinforcement Learning for Active Latent Planning

Add code
Jan 29, 2026
Viaarxiv icon

Token-level Collaborative Alignment for LLM-based Generative Recommendation

Add code
Jan 26, 2026
Viaarxiv icon

Self-Manager: Parallel Agent Loop for Long-form Deep Research

Add code
Jan 25, 2026
Viaarxiv icon

DynaDebate: Breaking Homogeneity in Multi-Agent Debate with Dynamic Path Generation

Add code
Jan 09, 2026
Viaarxiv icon

VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit

Add code
Jan 09, 2026
Viaarxiv icon

SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization

Add code
Nov 09, 2025
Viaarxiv icon

A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning

Add code
Sep 26, 2025
Figure 1 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 2 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 3 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Figure 4 for A2R: An Asymmetric Two-Stage Reasoning Framework for Parallel Reasoning
Viaarxiv icon

PerchMobi^3: A Multi-Modal Robot with Power-Reuse Quad-Fan Mechanism for Air-Ground-Wall Locomotion

Add code
Sep 16, 2025
Viaarxiv icon

GLEAM: Learning to Match and Explain in Cross-View Geo-Localization

Add code
Sep 09, 2025
Viaarxiv icon