Picture for Peisong Wang

Peisong Wang

Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization

Add code
Feb 07, 2026
Viaarxiv icon

SparVAR: Exploring Sparsity in Visual AutoRegressive Modeling for Training-Free Acceleration

Add code
Feb 04, 2026
Viaarxiv icon

DALI: A Workload-Aware Offloading Framework for Efficient MoE Inference on Local PCs

Add code
Feb 03, 2026
Viaarxiv icon

IntraSlice: Towards High-Performance Structural Pruning with Block-Intra PCA for LLMs

Add code
Feb 02, 2026
Viaarxiv icon

Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE

Add code
Feb 02, 2026
Viaarxiv icon

CoTBox-TTT: Grounding Medical VQA with Visual Chain-of-Thought Boxes During Test-time Training

Add code
Nov 16, 2025
Viaarxiv icon

DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization

Add code
Nov 06, 2025
Viaarxiv icon

Block Rotation is All You Need for MXFP4 Quantization

Add code
Nov 06, 2025
Viaarxiv icon

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

Add code
May 01, 2025
Figure 1 for Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Figure 2 for Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Figure 3 for Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Figure 4 for Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models
Viaarxiv icon

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning

Add code
Apr 27, 2025
Viaarxiv icon