Picture for Long Lan

Long Lan

Towards Efficient Partially Relevant Video Retrieval with Active Moment Discovering

Add code
Apr 15, 2025
Viaarxiv icon

JointTuner: Appearance-Motion Adaptive Joint Training for Customized Video Generation

Add code
Mar 31, 2025
Viaarxiv icon

XLRS-Bench: Could Your Multimodal LLMs Understand Extremely Large Ultra-High-Resolution Remote Sensing Imagery?

Add code
Mar 31, 2025
Viaarxiv icon

Let Synthetic Data Shine: Domain Reassembly and Soft-Fusion for Single Domain Generalization

Add code
Mar 17, 2025
Viaarxiv icon

RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing

Add code
Mar 13, 2025
Viaarxiv icon

MagicNaming: Consistent Identity Generation by Finding a "Name Space" in T2I Diffusion Models

Add code
Dec 19, 2024
Figure 1 for MagicNaming: Consistent Identity Generation by Finding a "Name Space" in T2I Diffusion Models
Figure 2 for MagicNaming: Consistent Identity Generation by Finding a "Name Space" in T2I Diffusion Models
Figure 3 for MagicNaming: Consistent Identity Generation by Finding a "Name Space" in T2I Diffusion Models
Figure 4 for MagicNaming: Consistent Identity Generation by Finding a "Name Space" in T2I Diffusion Models
Viaarxiv icon

Object Style Diffusion for Generalized Object Detection in Urban Scene

Add code
Dec 18, 2024
Figure 1 for Object Style Diffusion for Generalized Object Detection in Urban Scene
Figure 2 for Object Style Diffusion for Generalized Object Detection in Urban Scene
Figure 3 for Object Style Diffusion for Generalized Object Detection in Urban Scene
Figure 4 for Object Style Diffusion for Generalized Object Detection in Urban Scene
Viaarxiv icon

Relieving Universal Label Noise for Unsupervised Visible-Infrared Person Re-Identification by Inferring from Neighbors

Add code
Dec 16, 2024
Viaarxiv icon

Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation

Add code
Oct 17, 2024
Figure 1 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 2 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 3 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Figure 4 for Temporal-Enhanced Multimodal Transformer for Referring Multi-Object Tracking and Segmentation
Viaarxiv icon

MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model

Add code
Aug 17, 2024
Viaarxiv icon