Picture for Kaiqi Huang

Kaiqi Huang

How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking

Add code
Nov 23, 2024
Figure 1 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 2 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 3 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 4 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Viaarxiv icon

DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM

Add code
Oct 03, 2024
Figure 1 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 2 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 3 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 4 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Viaarxiv icon

Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark

Add code
Sep 13, 2024
Figure 1 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Figure 2 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Figure 3 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Viaarxiv icon

Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness

Add code
Jul 12, 2024
Figure 1 for Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Figure 2 for Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Figure 3 for Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Figure 4 for Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Viaarxiv icon

SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling

Add code
May 21, 2024
Figure 1 for SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling
Figure 2 for SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling
Figure 3 for SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling
Figure 4 for SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling
Viaarxiv icon

DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM

Add code
May 20, 2024
Viaarxiv icon

PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution

Add code
Mar 16, 2024
Figure 1 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 2 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 3 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 4 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Viaarxiv icon

TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

Add code
Jan 15, 2024
Figure 1 for TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
Figure 2 for TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
Figure 3 for TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
Figure 4 for TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient
Viaarxiv icon

Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models

Add code
Jan 15, 2024
Figure 1 for Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models
Figure 2 for Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models
Figure 3 for Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models
Figure 4 for Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models
Viaarxiv icon

Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers

Add code
Oct 09, 2023
Viaarxiv icon