Picture for Zhaochong An

Zhaochong An

Video Understanding: From Geometry and Semantics to Unified Models

Add code
Mar 18, 2026
Viaarxiv icon

Revisiting the Perception-Distortion Trade-off with Spatial-Semantic Guided Super-Resolution

Add code
Mar 14, 2026
Viaarxiv icon

VecGlypher: Unified Vector Glyph Generation with Language Models

Add code
Feb 25, 2026
Viaarxiv icon

Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning

Add code
Jan 28, 2026
Viaarxiv icon

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Add code
Dec 24, 2025
Figure 1 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 2 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 3 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 4 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Viaarxiv icon

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Add code
Dec 08, 2025
Figure 1 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 2 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 3 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 4 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Viaarxiv icon

Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory

Add code
May 28, 2025
Viaarxiv icon

Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model

Add code
Mar 20, 2025
Figure 1 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 2 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 3 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Figure 4 for Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model
Viaarxiv icon

Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation

Add code
Oct 29, 2024
Figure 1 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 2 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 3 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Figure 4 for Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation
Viaarxiv icon

DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer

Add code
Sep 12, 2024
Figure 1 for DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Figure 2 for DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Figure 3 for DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Figure 4 for DreamBeast: Distilling 3D Fantastical Animals with Part-Aware Knowledge Transfer
Viaarxiv icon