Picture for Zixuan Dong

Zixuan Dong

EffiQA: Efficient Question-Answering with Strategic Multi-Model Collaboration on Knowledge Graphs

Add code
Jun 03, 2024
Viaarxiv icon

Cross Entropy versus Label Smoothing: A Neural Collapse Perspective

Add code
Feb 07, 2024
Viaarxiv icon

Pre-training with Synthetic Data Helps Offline Reinforcement Learning

Add code
Oct 06, 2023
Figure 1 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 2 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 3 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Figure 4 for Pre-training with Synthetic Data Helps Offline Reinforcement Learning
Viaarxiv icon

On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs

Add code
Sep 07, 2022
Figure 1 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 2 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 3 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 4 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Viaarxiv icon