Picture for Tonghan Wang

Tonghan Wang

NaiAD: Initiate Data-Driven Research for LLM Advertising

Add code
May 11, 2026
Viaarxiv icon

How LLMs Are Persuaded: A Few Attention Heads, Rerouted

Add code
May 10, 2026
Viaarxiv icon

Incentive-Aware Multi-Fidelity Optimization for Generative Advertising in Large Language Models

Add code
Apr 07, 2026
Viaarxiv icon

LLM Active Alignment: A Nash Equilibrium Perspective

Add code
Feb 06, 2026
Viaarxiv icon

Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data

Add code
May 29, 2025
Viaarxiv icon

Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing

Add code
May 27, 2025
Viaarxiv icon

Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

Add code
Feb 18, 2025
Viaarxiv icon

On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow

Add code
Oct 17, 2024
Figure 1 for On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Figure 2 for On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Figure 3 for On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Figure 4 for On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Viaarxiv icon

The Bandit Whisperer: Communication Learning for Restless Bandits

Add code
Aug 11, 2024
Figure 1 for The Bandit Whisperer: Communication Learning for Restless Bandits
Figure 2 for The Bandit Whisperer: Communication Learning for Restless Bandits
Figure 3 for The Bandit Whisperer: Communication Learning for Restless Bandits
Figure 4 for The Bandit Whisperer: Communication Learning for Restless Bandits
Viaarxiv icon

Principal-Agent Reinforcement Learning

Add code
Jul 25, 2024
Figure 1 for Principal-Agent Reinforcement Learning
Figure 2 for Principal-Agent Reinforcement Learning
Figure 3 for Principal-Agent Reinforcement Learning
Figure 4 for Principal-Agent Reinforcement Learning
Viaarxiv icon