Picture for Xudong Lu

Xudong Lu

School of Biomedical Engineering and Instrumental Science, Zhejiang University, Hangzhou, P.R. China, School of Industrial Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands

GLEAM: Learning to Match and Explain in Cross-View Geo-Localization

Add code
Sep 09, 2025
Viaarxiv icon

Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding

Add code
May 08, 2025
Viaarxiv icon

SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant?

Add code
Mar 08, 2025
Viaarxiv icon

GenieBlue: Integrating both Linguistic and Multimodal Capabilities for Large Language Models on Mobile Devices

Add code
Mar 08, 2025
Viaarxiv icon

Rethinking Video Tokenization: A Conditioned Diffusion-based Approach

Add code
Mar 05, 2025
Viaarxiv icon

PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection

Add code
Jan 23, 2025
Figure 1 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 2 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 3 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 4 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Viaarxiv icon

CodeV: Issue Resolving with Visual Data

Add code
Dec 23, 2024
Viaarxiv icon

BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices

Add code
Nov 16, 2024
Viaarxiv icon

ThinK: Thinner Key Cache by Query-Driven Pruning

Add code
Jul 30, 2024
Viaarxiv icon

FabGPT: An Efficient Large Multimodal Model for Complex Wafer Defect Knowledge Queries

Add code
Jul 15, 2024
Viaarxiv icon