Picture for Chen Zhang

Chen Zhang

SenseTime Research

PruneTIR: Inference-Time Tool Call Pruning for Effective yet Efficient Tool-Integrated Reasoning

Add code
May 11, 2026
Viaarxiv icon

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions

Add code
Apr 24, 2026
Viaarxiv icon

TriEx: A Game-based Tri-View Framework for Explaining Internal Reasoning in Multi-Agent LLMs

Add code
Apr 21, 2026
Viaarxiv icon

Efficient Low-Resource Language Adaptation via Multi-Source Dynamic Logit Fusion

Add code
Apr 20, 2026
Viaarxiv icon

A-IO: Adaptive Inference Orchestration for Memory-Bound NPUs

Add code
Apr 10, 2026
Viaarxiv icon

Select-then-Solve: Paradigm Routing as Inference-Time Optimization for LLM Agents

Add code
Apr 08, 2026
Viaarxiv icon

An Empirical Study of Many-Shot In-Context Learning for Machine Translation of Low-Resource Languages

Add code
Apr 03, 2026
Viaarxiv icon

VecAttention: Vector-wise Sparse Attention for Accelerating Long Context Inference

Add code
Mar 31, 2026
Viaarxiv icon

WAter: A Workload-Adaptive Knob Tuning System based on Workload Compression

Add code
Mar 28, 2026
Viaarxiv icon

Stabilizing Rubric Integration Training via Decoupled Advantage Normalization

Add code
Mar 27, 2026
Viaarxiv icon