Picture for Zhuoran Yang

Zhuoran Yang

Decoding Rewards in Competitive Games: Inverse Game Theory with Entropy Regularization

Add code
Jan 19, 2026
Viaarxiv icon

Demystifying the Slash Pattern in Attention: The Role of RoPE

Add code
Jan 13, 2026
Viaarxiv icon

Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning

Add code
Oct 15, 2025
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

The Sample Complexity of Online Strategic Decision Making with Information Asymmetry and Knowledge Transportability

Add code
Jun 11, 2025
Viaarxiv icon

Learning to Lead: Incentivizing Strategic Agents in the Dark

Add code
Jun 10, 2025
Viaarxiv icon

Quantile-Optimal Policy Learning under Unmeasured Confounding

Add code
Jun 08, 2025
Viaarxiv icon

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms

Add code
May 21, 2025
Figure 1 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 2 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 3 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Figure 4 for BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms
Viaarxiv icon

Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving

Add code
Apr 17, 2025
Figure 1 for Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Figure 2 for Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Figure 3 for Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Figure 4 for Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving
Viaarxiv icon

In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention

Add code
Mar 17, 2025
Figure 1 for In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
Figure 2 for In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
Figure 3 for In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
Figure 4 for In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention
Viaarxiv icon