Picture for Zhihui Zhu

Zhihui Zhu

Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations

Add code
Jun 10, 2025
Viaarxiv icon

On the Convergence of Gradient Descent on Learning Transformers with Residual Connections

Add code
Jun 05, 2025
Viaarxiv icon

Analyzing Fine-Grained Alignment and Enhancing Vision Understanding in Multimodal Language Models

Add code
May 22, 2025
Viaarxiv icon

From Compression to Expansion: A Layerwise Analysis of In-Context Learning

Add code
May 22, 2025
Viaarxiv icon

Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling

Add code
Feb 09, 2025
Figure 1 for Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Figure 2 for Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Figure 3 for Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Figure 4 for Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Viaarxiv icon

Optimal Error Analysis of Channel Estimation for IRS-assisted MIMO Systems

Add code
Dec 22, 2024
Viaarxiv icon

Analyzing and Improving Model Collapse in Rectified Flow Models

Add code
Dec 11, 2024
Viaarxiv icon

Optimal Allocation of Pauli Measurements for Low-rank Quantum State Tomography

Add code
Nov 07, 2024
Viaarxiv icon

Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data

Add code
Oct 22, 2024
Figure 1 for Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Figure 2 for Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Figure 3 for Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Figure 4 for Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Viaarxiv icon

Robust Low-rank Tensor Train Recovery

Add code
Oct 19, 2024
Viaarxiv icon