Picture for Weijia Li

Weijia Li

OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data

Add code
May 29, 2025
Viaarxiv icon

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Add code
May 25, 2025
Viaarxiv icon

Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind

Add code
May 18, 2025
Viaarxiv icon

LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection

Add code
Apr 17, 2025
Viaarxiv icon

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Add code
Apr 03, 2025
Viaarxiv icon

Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration

Add code
Apr 01, 2025
Viaarxiv icon

LEGION: Learning to Ground and Explain for Synthetic Image Detection

Add code
Mar 19, 2025
Viaarxiv icon

Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation

Add code
Mar 19, 2025
Viaarxiv icon

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

Add code
Feb 17, 2025
Viaarxiv icon

Token Pruning in Multimodal Large Language Models: Are We Solving the Right Problem?

Add code
Feb 17, 2025
Viaarxiv icon