Picture for Zixin Wen

Zixin Wen

Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment

Add code
Oct 28, 2024
Viaarxiv icon

Transformers Provably Learn Feature-Position Correlations in Masked Image Modeling

Add code
Mar 04, 2024
Figure 1 for Transformers Provably Learn Feature-Position Correlations in Masked Image Modeling
Figure 2 for Transformers Provably Learn Feature-Position Correlations in Masked Image Modeling
Figure 3 for Transformers Provably Learn Feature-Position Correlations in Masked Image Modeling
Figure 4 for Transformers Provably Learn Feature-Position Correlations in Masked Image Modeling
Viaarxiv icon

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

Add code
Mar 01, 2024
Figure 1 for Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Figure 2 for Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Figure 3 for Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Figure 4 for Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning
Viaarxiv icon

What Matters In The Structured Pruning of Generative Language Models?

Add code
Feb 07, 2023
Figure 1 for What Matters In The Structured Pruning of Generative Language Models?
Figure 2 for What Matters In The Structured Pruning of Generative Language Models?
Figure 3 for What Matters In The Structured Pruning of Generative Language Models?
Figure 4 for What Matters In The Structured Pruning of Generative Language Models?
Viaarxiv icon

The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning

Add code
May 14, 2022
Figure 1 for The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Figure 2 for The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Figure 3 for The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Figure 4 for The Mechanism of Prediction Head in Non-contrastive Self-supervised Learning
Viaarxiv icon

Improving Multi-Modal Learning with Uni-Modal Teachers

Add code
Jun 21, 2021
Figure 1 for Improving Multi-Modal Learning with Uni-Modal Teachers
Figure 2 for Improving Multi-Modal Learning with Uni-Modal Teachers
Figure 3 for Improving Multi-Modal Learning with Uni-Modal Teachers
Figure 4 for Improving Multi-Modal Learning with Uni-Modal Teachers
Viaarxiv icon

Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning

Add code
Jun 12, 2021
Figure 1 for Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Figure 2 for Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Figure 3 for Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Figure 4 for Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning
Viaarxiv icon

Convergence of End-to-End Training in Deep Unsupervised Contrasitive Learning

Add code
Feb 21, 2020
Viaarxiv icon