Picture for Junyan Ye

Junyan Ye

UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective

Add code
Sep 26, 2025
Figure 1 for UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
Figure 2 for UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
Figure 3 for UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
Figure 4 for UrbanFeel: A Comprehensive Benchmark for Temporal and Perceptual Understanding of City Scenes through Human Perspective
Viaarxiv icon

Can Understanding and Generation Truly Benefit Together -- or Just Coexist?

Add code
Sep 11, 2025
Viaarxiv icon

Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation

Add code
Aug 13, 2025
Viaarxiv icon

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Add code
Apr 03, 2025
Figure 1 for GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
Figure 2 for GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
Figure 3 for GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
Figure 4 for GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation
Viaarxiv icon

Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration

Add code
Apr 01, 2025
Viaarxiv icon

LEGION: Learning to Ground and Explain for Synthetic Image Detection

Add code
Mar 19, 2025
Viaarxiv icon

Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation

Add code
Mar 19, 2025
Viaarxiv icon

Where am I? Cross-View Geo-localization with Natural Language Descriptions

Add code
Dec 22, 2024
Viaarxiv icon

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Add code
Oct 13, 2024
Figure 1 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 2 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 3 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 4 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Viaarxiv icon

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios

Add code
Aug 30, 2024
Viaarxiv icon