Picture for Yukai Huang

Yukai Huang

Color When It Counts: Grayscale-Guided Online Triggering for Always-On Streaming Video Sensing

Add code
Mar 23, 2026
Viaarxiv icon

Egocentric Co-Pilot: Web-Native Smart-Glasses Agents for Assistive Egocentric AI

Add code
Mar 01, 2026
Viaarxiv icon

EgoGraph: Temporal Knowledge Graph for Egocentric Video Understanding

Add code
Feb 27, 2026
Viaarxiv icon

SecCodeBench-V2 Technical Report

Add code
Feb 17, 2026
Viaarxiv icon

Optimizing Multimodal LLMs for Egocentric Video Understanding: A Solution for the HD-EPIC VQA Challenge

Add code
Jan 15, 2026
Viaarxiv icon

Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation

Add code
Nov 12, 2025
Figure 1 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 2 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 3 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Figure 4 for Plug-and-Play Clarifier: A Zero-Shot Multimodal Framework for Egocentric Intent Disambiguation
Viaarxiv icon

Dolphin: A Large-Scale Automatic Speech Recognition Model for Eastern Languages

Add code
Mar 26, 2025
Viaarxiv icon

speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment

Add code
Apr 03, 2021
Figure 1 for speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment
Figure 2 for speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment
Figure 3 for speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment
Figure 4 for speechocean762: An Open-Source Non-native English Speech Corpus For Pronunciation Assessment
Viaarxiv icon