Picture for Yi-Hsuan Tsai

Yi-Hsuan Tsai

Beyond Words: Multimodal LLM Knows When to Speak

Add code
May 20, 2025
Viaarxiv icon

Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting

Add code
Apr 03, 2025
Viaarxiv icon

uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images

Add code
Mar 27, 2025
Viaarxiv icon

What Changed and What Could Have Changed? State-Change Counterfactuals for Procedure-Aware Video Representation Learning

Add code
Mar 27, 2025
Viaarxiv icon

Exemplar Masking for Multimodal Incremental Learning

Add code
Dec 12, 2024
Viaarxiv icon

Ranking-aware adapter for text-driven image ordering with CLIP

Add code
Dec 09, 2024
Viaarxiv icon

Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models

Add code
Dec 09, 2024
Viaarxiv icon

Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation

Add code
Sep 29, 2024
Figure 1 for Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Figure 2 for Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Figure 3 for Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Figure 4 for Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation
Viaarxiv icon

Self-training Room Layout Estimation via Geometry-aware Ray-casting

Add code
Jul 21, 2024
Figure 1 for Self-training Room Layout Estimation via Geometry-aware Ray-casting
Figure 2 for Self-training Room Layout Estimation via Geometry-aware Ray-casting
Figure 3 for Self-training Room Layout Estimation via Geometry-aware Ray-casting
Figure 4 for Self-training Room Layout Estimation via Geometry-aware Ray-casting
Viaarxiv icon

Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts

Add code
Jul 10, 2024
Figure 1 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 2 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 3 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Figure 4 for Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts
Viaarxiv icon