Picture for Yunpu Ma

Yunpu Ma

miniReranker: Efficient Multimodal Reranking through Visual Cache Reuse and Interaction Sparsity

Add code
Jun 09, 2026
Viaarxiv icon

Beyond FLOPs: Benchmarking Real Inference Acceleration of LLM Pruning under a GEMM-Centric Taxonomy

Add code
Jun 08, 2026
Viaarxiv icon

ProactiveLLM: Learning Active Interaction for Streaming Large Language Models

Add code
May 30, 2026
Viaarxiv icon

EchoRL: Reinforcement Learning via Rollout Echoing

Add code
May 29, 2026
Viaarxiv icon

Memory-R2: Fair Credit Assignment for Long-Horizon Memory-Augmented LLM Agents

Add code
May 20, 2026
Viaarxiv icon

Mem$^2$Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation

Add code
Apr 13, 2026
Viaarxiv icon

Routing-Free Mixture-of-Experts

Add code
Apr 01, 2026
Viaarxiv icon

Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models

Add code
Mar 03, 2026
Viaarxiv icon

UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking

Add code
Feb 27, 2026
Viaarxiv icon

HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit

Add code
Feb 27, 2026
Viaarxiv icon