Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

Provable Particle-based Primal-Dual Algorithm for Mixed Nash Equilibrium

Add code
Mar 02, 2023
Viaarxiv icon

Active Prompting with Chain-of-Thought for Large Language Models

Add code
Feb 26, 2023
Figure 1 for Active Prompting with Chain-of-Thought for Large Language Models
Figure 2 for Active Prompting with Chain-of-Thought for Large Language Models
Figure 3 for Active Prompting with Chain-of-Thought for Large Language Models
Figure 4 for Active Prompting with Chain-of-Thought for Large Language Models
Viaarxiv icon

Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data

Add code
Feb 24, 2023
Figure 1 for Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
Figure 2 for Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
Figure 3 for Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
Figure 4 for Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
Viaarxiv icon

A Heuristic Autonomous Exploration Method Based on Environmental Information Gain During Quadrotor Flight

Add code
Feb 21, 2023
Figure 1 for A Heuristic Autonomous Exploration Method Based on Environmental Information Gain During Quadrotor Flight
Figure 2 for A Heuristic Autonomous Exploration Method Based on Environmental Information Gain During Quadrotor Flight
Figure 3 for A Heuristic Autonomous Exploration Method Based on Environmental Information Gain During Quadrotor Flight
Figure 4 for A Heuristic Autonomous Exploration Method Based on Environmental Information Gain During Quadrotor Flight
Viaarxiv icon

Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency

Add code
Feb 21, 2023
Figure 1 for Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Figure 2 for Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency
Viaarxiv icon

Hashtag-Guided Low-Resource Tweet Classification

Add code
Feb 20, 2023
Viaarxiv icon

On the Convergence of Federated Averaging with Cyclic Client Participation

Add code
Feb 06, 2023
Figure 1 for On the Convergence of Federated Averaging with Cyclic Client Participation
Figure 2 for On the Convergence of Federated Averaging with Cyclic Client Participation
Figure 3 for On the Convergence of Federated Averaging with Cyclic Client Participation
Figure 4 for On the Convergence of Federated Averaging with Cyclic Client Participation
Viaarxiv icon

Learning in POMDPs is Sample-Efficient with Hindsight Observability

Add code
Feb 03, 2023
Viaarxiv icon

History-Aware Hierarchical Transformer for Multi-session Open-domain Dialogue System

Add code
Feb 02, 2023
Viaarxiv icon

ADAPT: Action-aware Driving Caption Transformer

Add code
Feb 01, 2023
Figure 1 for ADAPT: Action-aware Driving Caption Transformer
Figure 2 for ADAPT: Action-aware Driving Caption Transformer
Figure 3 for ADAPT: Action-aware Driving Caption Transformer
Figure 4 for ADAPT: Action-aware Driving Caption Transformer
Viaarxiv icon