Picture for Yu-Chiang Frank Wang

Yu-Chiang Frank Wang

Frequency Switching Mechanism for Parameter-E!cient Multi-Task Learning

Add code
Mar 22, 2026
Viaarxiv icon

How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

Add code
Mar 19, 2026
Viaarxiv icon

VISTA: Validation-Guided Integration of Spatial and Temporal Foundation Models with Anatomical Decoding for Rare-Pathology VCE Event Detection

Add code
Mar 18, 2026
Viaarxiv icon

Advancing Structured Priors for Sparse-Voxel Surface Reconstruction

Add code
Jan 25, 2026
Viaarxiv icon

MV-SAM: Multi-view Promptable Segmentation using Pointmap Guidance

Add code
Jan 25, 2026
Viaarxiv icon

GaussExplorer: 3D Gaussian Splatting for Embodied Exploration and Reasoning

Add code
Jan 19, 2026
Viaarxiv icon

OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding

Add code
Jan 14, 2026
Viaarxiv icon

Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception

Add code
Jan 14, 2026
Viaarxiv icon

Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning

Add code
Jan 14, 2026
Viaarxiv icon

TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors

Add code
Jan 06, 2026
Viaarxiv icon