Picture for Tonghan Wang

Tonghan Wang

Duality for Optimal Multi-Item, Multi-Bidder Auction Design: Revenue Certificates through Deep Learning

Add code
Jun 08, 2026
Viaarxiv icon

NaiAD: Initiate Data-Driven Research for LLM Advertising

Add code
May 11, 2026
Viaarxiv icon

How LLMs Are Persuaded: A Few Attention Heads, Rerouted

Add code
May 10, 2026
Viaarxiv icon

Incentive-Aware Multi-Fidelity Optimization for Generative Advertising in Large Language Models

Add code
Apr 07, 2026
Viaarxiv icon

LLM Active Alignment: A Nash Equilibrium Perspective

Add code
Feb 06, 2026
Viaarxiv icon

Composite Flow Matching for Reinforcement Learning with Shifted-Dynamics Data

Add code
May 29, 2025
Viaarxiv icon

Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing

Add code
May 27, 2025
Viaarxiv icon

Policy-to-Language: Train LLMs to Explain Decisions with Flow-Matching Generated Rewards

Add code
Feb 18, 2025
Viaarxiv icon

On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow

Add code
Oct 17, 2024
Figure 1 for On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Figure 2 for On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Figure 3 for On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Figure 4 for On Diffusion Models for Multi-Agent Partial Observability: Shared Attractors, Error Bounds, and Composite Flow
Viaarxiv icon

The Bandit Whisperer: Communication Learning for Restless Bandits

Add code
Aug 11, 2024
Figure 1 for The Bandit Whisperer: Communication Learning for Restless Bandits
Figure 2 for The Bandit Whisperer: Communication Learning for Restless Bandits
Figure 3 for The Bandit Whisperer: Communication Learning for Restless Bandits
Figure 4 for The Bandit Whisperer: Communication Learning for Restless Bandits
Viaarxiv icon