Picture for Kai Zhou

Kai Zhou

ViTexQA: A Multi-Frame Temporal Perception Dataset for Video Text Question Answering

Add code
Jun 23, 2026
Viaarxiv icon

REDI-Match: Rotation-Equivariant Distillation for Efficient and Robust Dense Matching

Add code
Jun 23, 2026
Viaarxiv icon

Toward Polymorphic Backdoor against Semantic Communication via Intensity-Based Poisoning

Add code
Apr 25, 2026
Viaarxiv icon

Instance-level Visual Active Tracking with Occlusion-Aware Planning

Add code
Apr 23, 2026
Viaarxiv icon

Training-Free Test-Time Contrastive Learning for Large Language Models

Add code
Apr 15, 2026
Viaarxiv icon

HandX: Scaling Bimanual Motion and Interaction Generation

Add code
Mar 30, 2026
Viaarxiv icon

ScoutAttention: Efficient KV Cache Offloading via Layer-Ahead CPU Pre-computation for LLM Inference

Add code
Mar 28, 2026
Viaarxiv icon

PositionOCR: Augmenting Positional Awareness in Multi-Modal Models via Hybrid Specialist Integration

Add code
Feb 22, 2026
Viaarxiv icon

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Add code
Feb 05, 2026
Viaarxiv icon

A Review of Machine Learning for Cavitation Intensity Recognition in Complex Industrial Systems

Add code
Nov 19, 2025
Viaarxiv icon