Picture for Ting Cao

Ting Cao

Microsoft Research

GRIP-VLM: Group-Relative Importance Pruning for Efficient Vision-Language Models

Add code
May 13, 2026
Viaarxiv icon

EmbodiSkill: Skill-Aware Reflection for Self-Evolving Embodied Agents

Add code
May 11, 2026
Viaarxiv icon

Tessera: Unlocking Heterogeneous GPUs through Kernel-Granularity Disaggregation

Add code
Apr 11, 2026
Viaarxiv icon

Learning to Commit: Generating Organic Pull Requests via Online Repository Memory

Add code
Mar 27, 2026
Viaarxiv icon

Em-Garde: A Propose-Match Framework for Proactive Streaming Video Understanding

Add code
Mar 19, 2026
Viaarxiv icon

OxyGen: Unified KV Cache Management for Vision-Language-Action Models under Multi-Task Parallelism

Add code
Mar 15, 2026
Viaarxiv icon

Sculptor: Empowering LLMs with Cognitive Agency via Active Context Management

Add code
Aug 06, 2025
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Figure 1 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 2 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 3 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Figure 4 for SeerAttention-R: Sparse Attention Adaptation for Long Reasoning
Viaarxiv icon

SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale

Add code
May 26, 2025
Viaarxiv icon

Empowering Agentic Video Analytics Systems with Video Language Models

Add code
May 02, 2025
Figure 1 for Empowering Agentic Video Analytics Systems with Video Language Models
Figure 2 for Empowering Agentic Video Analytics Systems with Video Language Models
Figure 3 for Empowering Agentic Video Analytics Systems with Video Language Models
Figure 4 for Empowering Agentic Video Analytics Systems with Video Language Models
Viaarxiv icon