Picture for Jing Zhang

Jing Zhang

The University of Sydney, Australia

DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM

Add code
Oct 03, 2024
Figure 1 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 2 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 3 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 4 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Viaarxiv icon

SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model

Add code
Oct 03, 2024
Figure 1 for SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
Figure 2 for SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
Figure 3 for SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
Figure 4 for SoundMorpher: Perceptually-Uniform Sound Morphing with Diffusion Model
Viaarxiv icon

PCQPR: Proactive Conversational Question Planning with Reflection

Add code
Oct 02, 2024
Figure 1 for PCQPR: Proactive Conversational Question Planning with Reflection
Figure 2 for PCQPR: Proactive Conversational Question Planning with Reflection
Figure 3 for PCQPR: Proactive Conversational Question Planning with Reflection
Figure 4 for PCQPR: Proactive Conversational Question Planning with Reflection
Viaarxiv icon

Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and Interpolation

Add code
Sep 29, 2024
Figure 1 for Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and Interpolation
Figure 2 for Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and Interpolation
Figure 3 for Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and Interpolation
Figure 4 for Underwater Organism Color Enhancement via Color Code Decomposition, Adaptation and Interpolation
Viaarxiv icon

Evaluation of OpenAI o1: Opportunities and Challenges of AGI

Add code
Sep 27, 2024
Figure 1 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 2 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 3 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Figure 4 for Evaluation of OpenAI o1: Opportunities and Challenges of AGI
Viaarxiv icon

Few-Shot Class-Incremental Learning with Non-IID Decentralized Data

Add code
Sep 18, 2024
Figure 1 for Few-Shot Class-Incremental Learning with Non-IID Decentralized Data
Figure 2 for Few-Shot Class-Incremental Learning with Non-IID Decentralized Data
Figure 3 for Few-Shot Class-Incremental Learning with Non-IID Decentralized Data
Figure 4 for Few-Shot Class-Incremental Learning with Non-IID Decentralized Data
Viaarxiv icon

SoccerNet 2024 Challenges Results

Add code
Sep 16, 2024
Figure 1 for SoccerNet 2024 Challenges Results
Figure 2 for SoccerNet 2024 Challenges Results
Figure 3 for SoccerNet 2024 Challenges Results
Figure 4 for SoccerNet 2024 Challenges Results
Viaarxiv icon

GP-GPT: Large Language Model for Gene-Phenotype Mapping

Add code
Sep 15, 2024
Figure 1 for GP-GPT: Large Language Model for Gene-Phenotype Mapping
Figure 2 for GP-GPT: Large Language Model for Gene-Phenotype Mapping
Figure 3 for GP-GPT: Large Language Model for Gene-Phenotype Mapping
Figure 4 for GP-GPT: Large Language Model for Gene-Phenotype Mapping
Viaarxiv icon

Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark

Add code
Sep 13, 2024
Figure 1 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Figure 2 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Figure 3 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Viaarxiv icon

Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding

Add code
Sep 12, 2024
Figure 1 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 2 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 3 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Figure 4 for Dynamic Prompting of Frozen Text-to-Image Diffusion Models for Panoptic Narrative Grounding
Viaarxiv icon