Picture for Feng Zhao

Feng Zhao

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Add code
Jan 08, 2026
Viaarxiv icon

From Sequential to Spatial: Reordering Autoregression for Efficient Visual Generation

Add code
Dec 31, 2025
Viaarxiv icon

VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs

Add code
Dec 31, 2025
Viaarxiv icon

VideoScaffold: Elastic-Scale Visual Hierarchies for Streaming Video Understanding in MLLMs

Add code
Dec 23, 2025
Viaarxiv icon

MaskFocus: Focusing Policy Optimization on Critical Steps for Masked Image Generation

Add code
Dec 21, 2025
Viaarxiv icon

Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation

Add code
Dec 09, 2025
Figure 1 for Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
Figure 2 for Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
Figure 3 for Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
Figure 4 for Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
Viaarxiv icon

Fault2Flow: An AlphaEvolve-Optimized Human-in-the-Loop Multi-Agent System for Fault-to-Workflow Automation

Add code
Nov 17, 2025
Viaarxiv icon

Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification

Add code
Nov 13, 2025
Figure 1 for Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification
Figure 2 for Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification
Figure 3 for Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification
Figure 4 for Frequency-Aware Vision-Language Multimodality Generalization Network for Remote Sensing Image Classification
Viaarxiv icon

Group Critical-token Policy Optimization for Autoregressive Image Generation

Add code
Sep 26, 2025
Viaarxiv icon

Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis

Add code
Jul 31, 2025
Viaarxiv icon