Picture for Yihao Liu

Yihao Liu

Toward Generalizable Deblurring: Leveraging Massive Blur Priors with Linear Attention for Real-World Scenarios

Add code
Jan 10, 2026
Viaarxiv icon

Omni-Weather: Unified Multimodal Foundation Model for Weather Generation and Understanding

Add code
Dec 25, 2025
Viaarxiv icon

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Add code
Dec 25, 2025
Viaarxiv icon

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Figure 1 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 2 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 3 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 4 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Viaarxiv icon

MetaVoxel: Joint Diffusion Modeling of Imaging and Clinical Metadata

Add code
Dec 12, 2025
Viaarxiv icon

SynWeather: Weather Observation Data Synthesis across Multiple Regions and Variables via a General Diffusion Transformer

Add code
Nov 15, 2025
Viaarxiv icon

Phenotype discovery of traumatic brain injury segmentations from heterogeneous multi-site data

Add code
Nov 05, 2025
Figure 1 for Phenotype discovery of traumatic brain injury segmentations from heterogeneous multi-site data
Figure 2 for Phenotype discovery of traumatic brain injury segmentations from heterogeneous multi-site data
Figure 3 for Phenotype discovery of traumatic brain injury segmentations from heterogeneous multi-site data
Figure 4 for Phenotype discovery of traumatic brain injury segmentations from heterogeneous multi-site data
Viaarxiv icon

Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval

Add code
Oct 31, 2025
Viaarxiv icon

Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism

Add code
Oct 30, 2025
Viaarxiv icon

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Add code
Oct 14, 2025
Figure 1 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 2 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 3 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 4 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Viaarxiv icon