Picture for Changsheng Zhao

Changsheng Zhao

Exploring Audio Hallucination in Egocentric Video Understanding

Add code
Apr 26, 2026
Viaarxiv icon

RPRA: Predicting an LLM-Judge for Efficient but Performant Inference

Add code
Apr 14, 2026
Viaarxiv icon

Neural Computers

Add code
Apr 07, 2026
Viaarxiv icon

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

EgoAVU: Egocentric Audio-Visual Understanding

Add code
Feb 05, 2026
Viaarxiv icon

STEM: Scaling Transformers with Embedding Modules

Add code
Jan 15, 2026
Viaarxiv icon

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Add code
Jan 08, 2026
Viaarxiv icon

ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization

Add code
Feb 04, 2025
Viaarxiv icon

Llama Guard 3-1B-INT4: Compact and Efficient Safeguard for Human-AI Conversations

Add code
Nov 18, 2024
Viaarxiv icon

LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding

Add code
Oct 22, 2024
Figure 1 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 2 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 3 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Figure 4 for LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Viaarxiv icon