Picture for Zhiyuan Zhao

Zhiyuan Zhao

IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator-Critic Framework

Add code
Mar 10, 2026
Viaarxiv icon

UNICBench: UNIfied Counting Benchmark for MLLM

Add code
Feb 28, 2026
Viaarxiv icon

AHAP: Reconstructing Arbitrary Humans from Arbitrary Perspectives with Geometric Priors

Add code
Feb 27, 2026
Viaarxiv icon

Do MLLMs Really See It: Reinforcing Visual Attention in Multimodal LLMs

Add code
Feb 09, 2026
Viaarxiv icon

ChatUMM: Robust Context Tracking for Conversational Interleaved Generation

Add code
Feb 06, 2026
Viaarxiv icon

LEMAS: Large A 150K-Hour Large-scale Extensible Multilingual Audio Suite with Generative Speech Models

Add code
Jan 04, 2026
Viaarxiv icon

FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis

Add code
Dec 16, 2025
Figure 1 for FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis
Figure 2 for FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis
Figure 3 for FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis
Figure 4 for FusAD: Time-Frequency Fusion with Adaptive Denoising for General Time Series Analysis
Viaarxiv icon

Modular Deep-Learning-Based Early Warning System for Deadly Heatwave Prediction

Add code
Dec 09, 2025
Viaarxiv icon

Exploring the Underwater World Segmentation without Extra Training

Add code
Nov 11, 2025
Viaarxiv icon

OmniLayout: Enabling Coarse-to-Fine Learning with LLMs for Universal Document Layout Generation

Add code
Oct 30, 2025
Viaarxiv icon