Picture for Tian Wang

Tian Wang

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Add code
Jul 23, 2025
Figure 1 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 2 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 3 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 4 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Viaarxiv icon

OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation

Add code
Jun 05, 2025
Figure 1 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 2 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 3 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 4 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Viaarxiv icon

Hybrid Learning for Cold-Start-Aware Microservice Scheduling in Dynamic Edge Environments

Add code
May 28, 2025
Viaarxiv icon

FedHL: Federated Learning for Heterogeneous Low-Rank Adaptation via Unbiased Aggregation

Add code
May 24, 2025
Figure 1 for FedHL: Federated Learning for Heterogeneous Low-Rank Adaptation via Unbiased Aggregation
Figure 2 for FedHL: Federated Learning for Heterogeneous Low-Rank Adaptation via Unbiased Aggregation
Figure 3 for FedHL: Federated Learning for Heterogeneous Low-Rank Adaptation via Unbiased Aggregation
Figure 4 for FedHL: Federated Learning for Heterogeneous Low-Rank Adaptation via Unbiased Aggregation
Viaarxiv icon

AFCL: Analytic Federated Continual Learning for Spatio-Temporal Invariance of Non-IID Data

Add code
May 18, 2025
Viaarxiv icon

InfoPO: On Mutual Information Maximization for Large Language Model Alignment

Add code
May 13, 2025
Viaarxiv icon

Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning

Add code
Mar 31, 2025
Figure 1 for Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
Figure 2 for Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
Figure 3 for Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
Figure 4 for Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning
Viaarxiv icon

Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models

Add code
Mar 08, 2025
Figure 1 for Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Figure 2 for Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Figure 3 for Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Figure 4 for Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Viaarxiv icon

STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection

Add code
Dec 28, 2024
Viaarxiv icon

Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks

Add code
Dec 24, 2024
Figure 1 for Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks
Figure 2 for Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks
Figure 3 for Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks
Figure 4 for Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks
Viaarxiv icon