Picture for Jiahao Li

Jiahao Li

NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results

Add code
Apr 14, 2025
Viaarxiv icon

Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object Detection

Add code
Apr 06, 2025
Viaarxiv icon

A Survey on Unlearnable Data

Add code
Apr 01, 2025
Viaarxiv icon

FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model

Add code
Mar 25, 2025
Viaarxiv icon

DLF: Extreme Image Compression with Dual-generative Latent Fusion

Add code
Mar 03, 2025
Viaarxiv icon

Towards Practical Real-Time Neural Video Compression

Add code
Feb 28, 2025
Viaarxiv icon

Harnessing Discrete Differential Geometry: A Virtual Playground for the Bilayer Soft Robotics

Add code
Feb 02, 2025
Viaarxiv icon

EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents

Add code
Jan 21, 2025
Figure 1 for EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents
Figure 2 for EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents
Figure 3 for EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents
Figure 4 for EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents
Viaarxiv icon

UI-TARS: Pioneering Automated GUI Interaction with Native Agents

Add code
Jan 21, 2025
Viaarxiv icon

BSDB-Net: Band-Split Dual-Branch Network with Selective State Spaces Mechanism for Monaural Speech Enhancement

Add code
Dec 26, 2024
Viaarxiv icon