Picture for Zhihui Zhu

Zhihui Zhu

From Emergence to Control: Probing and Modulating Self-Reflection in Language Models

Add code
Jun 13, 2025
Viaarxiv icon

Understanding Task Vectors in In-Context Learning: Emergence, Functionality, and Limitations

Add code
Jun 10, 2025
Viaarxiv icon

On the Convergence of Gradient Descent on Learning Transformers with Residual Connections

Add code
Jun 05, 2025
Viaarxiv icon

Analyzing Fine-Grained Alignment and Enhancing Vision Understanding in Multimodal Language Models

Add code
May 22, 2025
Viaarxiv icon

From Compression to Expansion: A Layerwise Analysis of In-Context Learning

Add code
May 22, 2025
Viaarxiv icon

Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling

Add code
Feb 09, 2025
Figure 1 for Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Figure 2 for Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Figure 3 for Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Figure 4 for Understanding Representation Dynamics of Diffusion Models via Low-Dimensional Modeling
Viaarxiv icon

Optimal Error Analysis of Channel Estimation for IRS-assisted MIMO Systems

Add code
Dec 22, 2024
Viaarxiv icon

Analyzing and Improving Model Collapse in Rectified Flow Models

Add code
Dec 11, 2024
Figure 1 for Analyzing and Improving Model Collapse in Rectified Flow Models
Figure 2 for Analyzing and Improving Model Collapse in Rectified Flow Models
Figure 3 for Analyzing and Improving Model Collapse in Rectified Flow Models
Figure 4 for Analyzing and Improving Model Collapse in Rectified Flow Models
Viaarxiv icon

Optimal Allocation of Pauli Measurements for Low-rank Quantum State Tomography

Add code
Nov 07, 2024
Viaarxiv icon

Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data

Add code
Oct 22, 2024
Figure 1 for Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Figure 2 for Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Figure 3 for Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Figure 4 for Captions Speak Louder than Images (CASLIE): Generalizing Foundation Models for E-commerce from High-quality Multimodal Instruction Data
Viaarxiv icon