Picture for Kim-Hui Yap

Kim-Hui Yap

NTIRE 2026 Challenge on Bitstream-Corrupted Video Restoration: Methods and Results

Add code
Apr 09, 2026
Viaarxiv icon

Mixture-of-Modality-Experts with Holistic Token Learning for Fine-Grained Multimodal Visual Analytics in Driver Action Recognition

Add code
Apr 07, 2026
Viaarxiv icon

Accelerating Diffusion-based Video Editing via Heterogeneous Caching: Beyond Full Computing at Sampled Denoising Timestep

Add code
Mar 25, 2026
Viaarxiv icon

DAOS: A Multimodal In-cabin Behavior Monitoring with Driver Action-Object Synergy Dataset

Add code
Jan 17, 2026
Viaarxiv icon

MuseAgent-1: Interactive Grounded Multimodal Understanding of Music Scores and Performance Audio

Add code
Jan 17, 2026
Viaarxiv icon

Building Egocentric Procedural AI Assistant: Methods, Benchmarks, and Challenges

Add code
Nov 17, 2025
Viaarxiv icon

Towards Blind Bitstream-corrupted Video Recovery via a Visual Foundation Model-driven Framework

Add code
Jul 30, 2025
Viaarxiv icon

SSH-Net: A Self-Supervised and Hybrid Network for Noisy Image Watermark Removal

Add code
May 08, 2025
Viaarxiv icon

ByteNet: Rethinking Multimedia File Fragment Classification through Visual Perspectives

Add code
Oct 28, 2024
Viaarxiv icon

CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision Large Language Models

Add code
Oct 21, 2024
Viaarxiv icon