Picture for Kai Wang

Kai Wang

Refer to the report for detailed contributions

ControlFoley: Unified and Controllable Video-to-Audio Generation with Cross-Modal Conflict Handling

Add code
Apr 16, 2026
Viaarxiv icon

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

Add code
Apr 13, 2026
Viaarxiv icon

Back to Basics: Let Conversational Agents Remember with Just Retrieval and Generation

Add code
Apr 13, 2026
Viaarxiv icon

Beyond Loss Values: Robust Dynamic Pruning via Loss Trajectory Alignment

Add code
Apr 08, 2026
Viaarxiv icon

OmniSonic: Towards Universal and Holistic Audio Generation from Video and Text

Add code
Apr 06, 2026
Viaarxiv icon

Adaptive Action Chunking at Inference-time for Vision-Language-Action Models

Add code
Apr 05, 2026
Viaarxiv icon

Beyond Logit Adjustment: A Residual Decomposition Framework for Long-Tailed Reranking

Add code
Apr 02, 2026
Viaarxiv icon

TIR-Agent: Training an Explorative and Efficient Agent for Image Restoration

Add code
Mar 29, 2026
Viaarxiv icon

Autonomous Agent-Orchestrated Digital Twins (AADT): Leveraging the OpenClaw Framework for State Synchronization in Rare Genetic Disorders

Add code
Mar 28, 2026
Viaarxiv icon

From Oracle to Noisy Context: Mitigating Contextual Exposure Bias in Speech-LLMs

Add code
Mar 25, 2026
Viaarxiv icon