Picture for Yihua Shao

Yihua Shao

StyMam: A Mamba-Based Generator for Artistic Style Transfer

Add code
Jan 19, 2026
Viaarxiv icon

Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation

Add code
Jan 15, 2026
Viaarxiv icon

MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation

Add code
Dec 26, 2025
Viaarxiv icon

CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion

Add code
Oct 14, 2025
Viaarxiv icon

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Figure 1 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 2 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 3 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Figure 4 for MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook
Viaarxiv icon

AdsQA: Towards Advertisement Video Understanding

Add code
Sep 10, 2025
Viaarxiv icon

ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation

Add code
Aug 06, 2025
Figure 1 for ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation
Figure 2 for ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation
Figure 3 for ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation
Figure 4 for ICM-Fusion: In-Context Meta-Optimized LoRA Fusion for Multi-Task Adaptation
Viaarxiv icon

EventVAD: Training-Free Event-Aware Video Anomaly Detection

Add code
Apr 17, 2025
Viaarxiv icon

MambaIC: State Space Models for High-Performance Learned Image Compression

Add code
Mar 16, 2025
Viaarxiv icon

WonderVerse: Extendable 3D Scene Generation with Video Generative Models

Add code
Mar 13, 2025
Viaarxiv icon