Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

Desigen: A Pipeline for Controllable Design Template Generation

Add code
Mar 14, 2024
Viaarxiv icon

Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation

Add code
Mar 13, 2024
Figure 1 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 2 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 3 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Figure 4 for Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation
Viaarxiv icon

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization

Add code
Mar 13, 2024
Viaarxiv icon

Strength Lies in Differences! Towards Effective Non-collaborative Dialogues via Tailored Strategy Planning

Add code
Mar 11, 2024
Viaarxiv icon

OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation

Add code
Mar 11, 2024
Figure 1 for OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation
Figure 2 for OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation
Figure 3 for OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation
Figure 4 for OMH: Structured Sparsity via Optimally Matched Hierarchy for Unsupervised Semantic Segmentation
Viaarxiv icon

An Improved Analysis of Langevin Algorithms with Prior Diffusion for Non-Log-Concave Sampling

Add code
Mar 10, 2024
Viaarxiv icon

Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery

Add code
Mar 06, 2024
Figure 1 for Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Figure 2 for Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Figure 3 for Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Figure 4 for Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Viaarxiv icon

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

Add code
Mar 06, 2024
Viaarxiv icon

Energy-Efficient UAV Swarm Assisted MEC with Dynamic Clustering and Scheduling

Add code
Feb 29, 2024
Figure 1 for Energy-Efficient UAV Swarm Assisted MEC with Dynamic Clustering and Scheduling
Figure 2 for Energy-Efficient UAV Swarm Assisted MEC with Dynamic Clustering and Scheduling
Figure 3 for Energy-Efficient UAV Swarm Assisted MEC with Dynamic Clustering and Scheduling
Viaarxiv icon

Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering

Add code
Feb 29, 2024
Figure 1 for Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering
Figure 2 for Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering
Figure 3 for Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering
Figure 4 for Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering
Viaarxiv icon