Picture for Bin Luo

Bin Luo

Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception

Add code
Mar 05, 2024
Viaarxiv icon

Source-free Domain Adaptive Object Detection in Remote Sensing Images

Add code
Jan 31, 2024
Viaarxiv icon

WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope

Add code
Jan 12, 2024
Figure 1 for WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Figure 2 for WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Figure 3 for WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Figure 4 for WordArt Designer API: User-Driven Artistic Typography Synthesis with Large Language Models on ModelScope
Viaarxiv icon

Unifying Graph Contrastive Learning via Graph Message Augmentation

Add code
Jan 08, 2024
Figure 1 for Unifying Graph Contrastive Learning via Graph Message Augmentation
Figure 2 for Unifying Graph Contrastive Learning via Graph Message Augmentation
Figure 3 for Unifying Graph Contrastive Learning via Graph Message Augmentation
Figure 4 for Unifying Graph Contrastive Learning via Graph Message Augmentation
Viaarxiv icon

Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens

Add code
Jan 03, 2024
Figure 1 for Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens
Figure 2 for Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens
Figure 3 for Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens
Figure 4 for Transformer RGBT Tracking with Spatio-Temporal Multimodal Tokens
Viaarxiv icon

Tracking with Human-Intent Reasoning

Add code
Dec 29, 2023
Viaarxiv icon

A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering Tasks

Add code
Dec 25, 2023
Viaarxiv icon

Nighttime Person Re-Identification via Collaborative Enhancement Network with Multi-domain Learning

Add code
Dec 25, 2023
Figure 1 for Nighttime Person Re-Identification via Collaborative Enhancement Network with Multi-domain Learning
Figure 2 for Nighttime Person Re-Identification via Collaborative Enhancement Network with Multi-domain Learning
Figure 3 for Nighttime Person Re-Identification via Collaborative Enhancement Network with Multi-domain Learning
Figure 4 for Nighttime Person Re-Identification via Collaborative Enhancement Network with Multi-domain Learning
Viaarxiv icon

Modality-missing RGBT Tracking via Invertible Prompt Learning and A High-quality Data Simulation Method

Add code
Dec 25, 2023
Figure 1 for Modality-missing RGBT Tracking via Invertible Prompt Learning and A High-quality Data Simulation Method
Figure 2 for Modality-missing RGBT Tracking via Invertible Prompt Learning and A High-quality Data Simulation Method
Figure 3 for Modality-missing RGBT Tracking via Invertible Prompt Learning and A High-quality Data Simulation Method
Figure 4 for Modality-missing RGBT Tracking via Invertible Prompt Learning and A High-quality Data Simulation Method
Viaarxiv icon

Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models

Add code
Nov 30, 2023
Figure 1 for Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models
Figure 2 for Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models
Figure 3 for Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models
Figure 4 for Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models
Viaarxiv icon