Picture for Lei Wu

Lei Wu

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Add code
Nov 16, 2024
Viaarxiv icon

Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation

Add code
Oct 30, 2024
Viaarxiv icon

How Transformers Implement Induction Heads: Approximation and Optimization Analysis

Add code
Oct 15, 2024
Viaarxiv icon

DTactive: A Vision-Based Tactile Sensor with Active Surface

Add code
Oct 10, 2024
Viaarxiv icon

Why Do You Grok? A Theoretical Analysis of Grokking Modular Addition

Add code
Jul 17, 2024
Viaarxiv icon

Improving Generalization and Convergence by Enhancing Implicit Regularization

Add code
May 31, 2024
Viaarxiv icon

Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity

Add code
Apr 09, 2024
Figure 1 for Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity
Figure 2 for Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity
Figure 3 for Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity
Figure 4 for Exploring Neural Network Landscapes: Star-Shaped and Geodesic Connectivity
Viaarxiv icon

A Duality Analysis of Kernel Ridge Regression in the Noiseless Regime

Add code
Feb 24, 2024
Viaarxiv icon

The Implicit Bias of Gradient Noise: A Symmetry Perspective

Add code
Feb 11, 2024
Viaarxiv icon

Achieving Margin Maximization Exponentially Fast via Progressive Norm Rescaling

Add code
Dec 08, 2023
Viaarxiv icon