Picture for Yan Zheng

Yan Zheng

Dual-Enhancement Product Bundling: Bridging Interactive Graph and Large Language Model

Add code
Apr 15, 2026
Viaarxiv icon

VoxelCodeBench: Benchmarking 3D World Modeling Through Code Generation

Add code
Apr 02, 2026
Viaarxiv icon

The Rank and Gradient Lost in Non-stationarity: Sample Weight Decay for Mitigating Plasticity Loss in Reinforcement Learning

Add code
Apr 02, 2026
Viaarxiv icon

Rethink Efficiency Side of Neural Combinatorial Solver: An Offline and Self-Play Paradigm

Add code
Feb 24, 2026
Viaarxiv icon

Understanding LLM Evaluator Behavior: A Structured Multi-Evaluator Framework for Merchant Risk Assessment

Add code
Feb 04, 2026
Viaarxiv icon

Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA

Add code
Nov 19, 2025
Viaarxiv icon

Key Decision-Makers in Multi-Agent Debates: Who Holds the Power?

Add code
Nov 14, 2025
Viaarxiv icon

Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI

Add code
Sep 18, 2025
Viaarxiv icon

Empowering Time Series Forecasting with LLM-Agents

Add code
Aug 06, 2025
Viaarxiv icon

Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model

Add code
Jul 09, 2025
Figure 1 for Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
Figure 2 for Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
Figure 3 for Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
Figure 4 for Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model
Viaarxiv icon