Picture for Ziyi Wang

Ziyi Wang

Multi-Agent Reinforcement Learning for Market Making: Competition without Collusion

Add code
Oct 29, 2025
Viaarxiv icon

See, Think, Act: Online Shopper Behavior Simulation with VLM Agents

Add code
Oct 22, 2025
Viaarxiv icon

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping

Add code
Oct 08, 2025
Figure 1 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 2 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 3 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 4 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Viaarxiv icon

SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba

Add code
Oct 06, 2025
Figure 1 for SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba
Figure 2 for SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba
Figure 3 for SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba
Figure 4 for SpikingMamba: Towards Energy-Efficient Large Language Models via Knowledge Distillation from Mamba
Viaarxiv icon

CoT Vectors: Transferring and Probing the Reasoning Mechanisms of LLMs

Add code
Oct 01, 2025
Viaarxiv icon

UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling

Add code
Aug 20, 2025
Figure 1 for UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
Figure 2 for UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
Figure 3 for UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
Figure 4 for UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling
Viaarxiv icon

Recognizing Actions from Robotic View for Natural Human-Robot Interaction

Add code
Jul 30, 2025
Figure 1 for Recognizing Actions from Robotic View for Natural Human-Robot Interaction
Figure 2 for Recognizing Actions from Robotic View for Natural Human-Robot Interaction
Figure 3 for Recognizing Actions from Robotic View for Natural Human-Robot Interaction
Figure 4 for Recognizing Actions from Robotic View for Natural Human-Robot Interaction
Viaarxiv icon

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Add code
Jul 23, 2025
Figure 1 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 2 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 3 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 4 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Viaarxiv icon

PPAAS: PVT and Pareto Aware Analog Sizing via Goal-conditioned Reinforcement Learning

Add code
Jul 22, 2025
Figure 1 for PPAAS: PVT and Pareto Aware Analog Sizing via Goal-conditioned Reinforcement Learning
Figure 2 for PPAAS: PVT and Pareto Aware Analog Sizing via Goal-conditioned Reinforcement Learning
Figure 3 for PPAAS: PVT and Pareto Aware Analog Sizing via Goal-conditioned Reinforcement Learning
Figure 4 for PPAAS: PVT and Pareto Aware Analog Sizing via Goal-conditioned Reinforcement Learning
Viaarxiv icon

Vision Generalist Model: A Survey

Add code
Jun 11, 2025
Viaarxiv icon