Picture for Zhenmei Shi

Zhenmei Shi

Differential Privacy Mechanisms in Neural Tangent Kernel Regression

Add code
Jul 18, 2024
Viaarxiv icon

Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models

Add code
Jun 21, 2024
Figure 1 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 2 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 3 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Figure 4 for Is A Picture Worth A Thousand Words? Delving Into Spatial Reasoning for Vision Language Models
Viaarxiv icon

Toward Infinite-Long Prefix in Transformer

Add code
Jun 20, 2024
Viaarxiv icon

Why Larger Language Models Do In-context Learning Differently?

Add code
May 30, 2024
Viaarxiv icon

Tensor Attention Training: Provably Efficient Learning of Higher-order Transformers

Add code
May 26, 2024
Viaarxiv icon

Unraveling the Smoothness Properties of Diffusion Models: A Gaussian Mixture Perspective

Add code
May 26, 2024
Viaarxiv icon

Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers

Add code
May 08, 2024
Figure 1 for Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers
Figure 2 for Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers
Figure 3 for Conv-Basis: A New Paradigm for Efficient Attention Inference and Gradient Computation in Transformers
Viaarxiv icon

Exploring the Frontiers of Softmax: Provable Optimization, Applications in Diffusion Model, and Beyond

Add code
May 06, 2024
Viaarxiv icon

Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning

Add code
Feb 22, 2024
Figure 1 for Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Figure 2 for Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Figure 3 for Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Figure 4 for Towards Few-Shot Adaptation of Foundation Models via Multitask Finetuning
Viaarxiv icon

Fourier Circuits in Neural Networks: Unlocking the Potential of Large Language Models in Mathematical Reasoning and Modular Arithmetic

Add code
Feb 12, 2024
Figure 1 for Fourier Circuits in Neural Networks: Unlocking the Potential of Large Language Models in Mathematical Reasoning and Modular Arithmetic
Figure 2 for Fourier Circuits in Neural Networks: Unlocking the Potential of Large Language Models in Mathematical Reasoning and Modular Arithmetic
Figure 3 for Fourier Circuits in Neural Networks: Unlocking the Potential of Large Language Models in Mathematical Reasoning and Modular Arithmetic
Figure 4 for Fourier Circuits in Neural Networks: Unlocking the Potential of Large Language Models in Mathematical Reasoning and Modular Arithmetic
Viaarxiv icon