Picture for Zhedong Zheng

Zhedong Zheng

Uncertainty-o: One Model-agnostic Framework for Unveiling Uncertainty in Large Multimodal Models

Add code
Jun 09, 2025
Viaarxiv icon

Echo Planning for Autonomous Driving: From Current Observations to Future Trajectories and Back

Add code
May 25, 2025
Viaarxiv icon

CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval

Add code
Apr 26, 2025
Viaarxiv icon

Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generation

Add code
Mar 31, 2025
Viaarxiv icon

A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization

Add code
Dec 27, 2024
Viaarxiv icon

Relative Distance Guided Dynamic Partition Learning for Scale-Invariant UAV-View Geo-Localization

Add code
Dec 23, 2024
Viaarxiv icon

CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution

Add code
Dec 16, 2024
Viaarxiv icon

Near Large Far Small: Relative Distance Based Partition Learning for UAV-view Geo-Localization

Add code
Dec 16, 2024
Viaarxiv icon

Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark

Add code
Dec 03, 2024
Figure 1 for Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark
Figure 2 for Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark
Figure 3 for Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark
Figure 4 for Towards Rich Emotions in 3D Avatars: A Text-to-3D Avatar Generation Benchmark
Viaarxiv icon

RIGI: Rectifying Image-to-3D Generation Inconsistency via Uncertainty-aware Learning

Add code
Nov 28, 2024
Viaarxiv icon