Picture for Zhaozhuo Xu

Zhaozhuo Xu

Stevens Institute of Technology

The Diminishing Returns of Early-Exit Decoding in Modern LLMs

Add code
Mar 24, 2026
Viaarxiv icon

A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs

Add code
Feb 23, 2026
Viaarxiv icon

Scout Before You Attend: Sketch-and-Walk Sparse Attention for Efficient LLM Inference

Add code
Feb 07, 2026
Viaarxiv icon

Copyright Detective: A Forensic System to Evidence LLMs Flickering Copyright Leakage Risks

Add code
Feb 05, 2026
Viaarxiv icon

DEL-ToM: Inference-Time Scaling for Theory-of-Mind Reasoning via Dynamic Epistemic Logic

Add code
May 22, 2025
Viaarxiv icon

VTBench: Evaluating Visual Tokenizers for Autoregressive Image Generation

Add code
May 19, 2025
Viaarxiv icon

Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models

Add code
Apr 05, 2025
Figure 1 for Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models
Figure 2 for Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models
Figure 3 for Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models
Figure 4 for Sensitivity Meets Sparsity: The Impact of Extremely Sparse Parameter Patterns on Theory-of-Mind of Large Language Models
Viaarxiv icon

ALinFiK: Learning to Approximate Linearized Future Influence Kernel for Scalable Third-Parity LLM Data Valuation

Add code
Mar 02, 2025
Viaarxiv icon

Fox-1 Technical Report

Add code
Nov 08, 2024
Figure 1 for Fox-1 Technical Report
Figure 2 for Fox-1 Technical Report
Figure 3 for Fox-1 Technical Report
Figure 4 for Fox-1 Technical Report
Viaarxiv icon

Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs

Add code
Nov 07, 2024
Figure 1 for Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Figure 2 for Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Figure 3 for Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Figure 4 for Alopex: A Computational Framework for Enabling On-Device Function Calls with LLMs
Viaarxiv icon