Picture for Zheng Zhang

Zheng Zhang

Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing, School of Artificial Intelligence, Beihang University, China

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Add code
Mar 18, 2025
Figure 1 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 2 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 3 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Figure 4 for DAPO: An Open-Source LLM Reinforcement Learning System at Scale
Viaarxiv icon

TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting

Add code
Mar 11, 2025
Figure 1 for TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting
Figure 2 for TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting
Figure 3 for TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting
Figure 4 for TT-GaussOcc: Test-Time Compute for Self-Supervised Occupancy Prediction via Spatio-Temporal Gaussian Splatting
Viaarxiv icon

Wanda++: Pruning Large Language Models via Regional Gradients

Add code
Mar 06, 2025
Viaarxiv icon

Muon is Scalable for LLM Training

Add code
Feb 24, 2025
Viaarxiv icon

MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models

Add code
Feb 17, 2025
Figure 1 for MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models
Figure 2 for MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models
Figure 3 for MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models
Figure 4 for MaZO: Masked Zeroth-Order Optimization for Multi-Task Fine-Tuning of Large Language Models
Viaarxiv icon

Connector-S: A Survey of Connectors in Multi-modal Large Language Models

Add code
Feb 17, 2025
Figure 1 for Connector-S: A Survey of Connectors in Multi-modal Large Language Models
Figure 2 for Connector-S: A Survey of Connectors in Multi-modal Large Language Models
Figure 3 for Connector-S: A Survey of Connectors in Multi-modal Large Language Models
Figure 4 for Connector-S: A Survey of Connectors in Multi-modal Large Language Models
Viaarxiv icon

QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models

Add code
Feb 17, 2025
Figure 1 for QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models
Figure 2 for QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models
Figure 3 for QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models
Figure 4 for QuZO: Quantized Zeroth-Order Fine-Tuning for Large Language Models
Viaarxiv icon

Scalable Back-Propagation-Free Training of Optical Physics-Informed Neural Networks

Add code
Feb 17, 2025
Viaarxiv icon

DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification

Add code
Feb 04, 2025
Figure 1 for DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification
Figure 2 for DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification
Figure 3 for DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification
Figure 4 for DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification
Viaarxiv icon

PolaFormer: Polarity-aware Linear Attention for Vision Transformers

Add code
Jan 25, 2025
Viaarxiv icon