Picture for Yize Zhao

Yize Zhao

Structure Before Collapse: Transient semantic geometry in next-token prediction

Add code
Jun 25, 2026
Viaarxiv icon

Disentangling Latent Risk Pathways via Bayesian Hypergraph Inference

Add code
Jun 04, 2026
Viaarxiv icon

Workflow Closure Is Not Scientific Closure in Auto-Research Systems

Add code
May 25, 2026
Viaarxiv icon

Why Loss Re-weighting Works If You Stop Early: Training Dynamics of Unconstrained Features

Add code
Jan 17, 2026
Viaarxiv icon

How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data

Add code
Oct 27, 2025
Figure 1 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 2 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 3 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 4 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Viaarxiv icon

On the Geometry of Semantics in Next-token Prediction

Add code
May 13, 2025
Figure 1 for On the Geometry of Semantics in Next-token Prediction
Figure 2 for On the Geometry of Semantics in Next-token Prediction
Figure 3 for On the Geometry of Semantics in Next-token Prediction
Figure 4 for On the Geometry of Semantics in Next-token Prediction
Viaarxiv icon

DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models

Add code
Oct 12, 2024
Figure 1 for DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Figure 2 for DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Figure 3 for DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Figure 4 for DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
Viaarxiv icon

Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations

Add code
Aug 27, 2024
Figure 1 for Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations
Figure 2 for Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations
Figure 3 for Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations
Figure 4 for Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations
Viaarxiv icon

Learnable Community-Aware Transformer for Brain Connectome Analysis with Token Clustering

Add code
Mar 13, 2024
Figure 1 for Learnable Community-Aware Transformer for Brain Connectome Analysis with Token Clustering
Figure 2 for Learnable Community-Aware Transformer for Brain Connectome Analysis with Token Clustering
Figure 3 for Learnable Community-Aware Transformer for Brain Connectome Analysis with Token Clustering
Viaarxiv icon

Learning High-Order Relationships of Brain Regions

Add code
Dec 02, 2023
Figure 1 for Learning High-Order Relationships of Brain Regions
Figure 2 for Learning High-Order Relationships of Brain Regions
Figure 3 for Learning High-Order Relationships of Brain Regions
Figure 4 for Learning High-Order Relationships of Brain Regions
Viaarxiv icon