Picture for Da Li

Da Li

MERGETUNE: Continued fine-tuning of vision-language models

Add code
Jan 16, 2026
Viaarxiv icon

OMG-Bench: A New Challenging Benchmark for Skeleton-based Online Micro Hand Gesture Recognition

Add code
Dec 18, 2025
Viaarxiv icon

Run, Ruminate, and Regulate: A Dual-process Thinking System for Vision-and-Language Navigation

Add code
Nov 18, 2025
Viaarxiv icon

One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow

Add code
Nov 17, 2025
Viaarxiv icon

Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding

Add code
Nov 11, 2025
Viaarxiv icon

SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models

Add code
Aug 10, 2025
Figure 1 for SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models
Figure 2 for SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models
Figure 3 for SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models
Figure 4 for SketchAnimator: Animate Sketch via Motion Customization of Text-to-Video Diffusion Models
Viaarxiv icon

HierarchicalPrune: Position-Aware Compression for Large-Scale Diffusion Models

Add code
Aug 06, 2025
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

A Survey of Link Prediction in N-ary Knowledge Graphs

Add code
Jun 10, 2025
Figure 1 for A Survey of Link Prediction in N-ary Knowledge Graphs
Figure 2 for A Survey of Link Prediction in N-ary Knowledge Graphs
Figure 3 for A Survey of Link Prediction in N-ary Knowledge Graphs
Figure 4 for A Survey of Link Prediction in N-ary Knowledge Graphs
Viaarxiv icon

LifeIR at the NTCIR-18 Lifelog-6 Task

Add code
May 27, 2025
Viaarxiv icon