Picture for Zixiang Chen

Zixiang Chen

Fast Sampling via De-randomization for Discrete Diffusion Models

Add code
Dec 14, 2023
Figure 1 for Fast Sampling via De-randomization for Discrete Diffusion Models
Figure 2 for Fast Sampling via De-randomization for Discrete Diffusion Models
Figure 3 for Fast Sampling via De-randomization for Discrete Diffusion Models
Figure 4 for Fast Sampling via De-randomization for Discrete Diffusion Models
Viaarxiv icon

Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves

Add code
Nov 07, 2023
Figure 1 for Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Figure 2 for Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Figure 3 for Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Figure 4 for Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves
Viaarxiv icon

Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data

Add code
Oct 29, 2023
Figure 1 for Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Figure 2 for Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Figure 3 for Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Figure 4 for Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data
Viaarxiv icon

How Many Pretraining Tasks Are Needed for In-Context Learning of Linear Regression?

Add code
Oct 12, 2023
Viaarxiv icon

Why Does Sharpness-Aware Minimization Generalize Better Than SGD?

Add code
Oct 11, 2023
Figure 1 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 2 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 3 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Figure 4 for Why Does Sharpness-Aware Minimization Generalize Better Than SGD?
Viaarxiv icon

Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP

Add code
Oct 02, 2023
Viaarxiv icon

Benign Overfitting for Two-layer ReLU Networks

Add code
Mar 07, 2023
Viaarxiv icon

Learning High-Dimensional Single-Neuron ReLU Networks with Finite Samples

Add code
Mar 03, 2023
Viaarxiv icon

ISA-Net: Improved spatial attention network for PET-CT tumor segmentation

Add code
Nov 04, 2022
Figure 1 for ISA-Net: Improved spatial attention network for PET-CT tumor segmentation
Figure 2 for ISA-Net: Improved spatial attention network for PET-CT tumor segmentation
Figure 3 for ISA-Net: Improved spatial attention network for PET-CT tumor segmentation
Figure 4 for ISA-Net: Improved spatial attention network for PET-CT tumor segmentation
Viaarxiv icon

A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning

Add code
Sep 30, 2022
Figure 1 for A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
Figure 2 for A General Framework for Sample-Efficient Function Approximation in Reinforcement Learning
Viaarxiv icon