Picture for Xiaokun Feng

Xiaokun Feng

Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation

Add code
Aug 12, 2025
Viaarxiv icon

VMBench: A Benchmark for Perception-Aligned Video Motion Generation

Add code
Mar 13, 2025
Viaarxiv icon

How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking

Add code
Nov 23, 2024
Figure 1 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 2 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 3 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Figure 4 for How Texts Help? A Fine-grained Evaluation to Reveal the Role of Language in Vision-Language Tracking
Viaarxiv icon

DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM

Add code
Oct 03, 2024
Figure 1 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 2 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 3 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Figure 4 for DTVLT: A Multi-modal Diverse Text Benchmark for Visual Language Tracking Based on LLM
Viaarxiv icon

Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark

Add code
Sep 13, 2024
Figure 1 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Figure 2 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Figure 3 for Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark
Viaarxiv icon

Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness

Add code
Jul 12, 2024
Figure 1 for Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Figure 2 for Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Figure 3 for Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Figure 4 for Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Viaarxiv icon

DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM

Add code
May 20, 2024
Viaarxiv icon

See Your Heart: Psychological states Interpretation through Visual Creations

Add code
Feb 11, 2023
Figure 1 for See Your Heart: Psychological states Interpretation through Visual Creations
Figure 2 for See Your Heart: Psychological states Interpretation through Visual Creations
Figure 3 for See Your Heart: Psychological states Interpretation through Visual Creations
Figure 4 for See Your Heart: Psychological states Interpretation through Visual Creations
Viaarxiv icon

A Split Semantic Detection Algorithm for Psychological Sandplay Image

Add code
Mar 02, 2022
Figure 1 for A Split Semantic Detection Algorithm for Psychological Sandplay Image
Figure 2 for A Split Semantic Detection Algorithm for Psychological Sandplay Image
Figure 3 for A Split Semantic Detection Algorithm for Psychological Sandplay Image
Figure 4 for A Split Semantic Detection Algorithm for Psychological Sandplay Image
Viaarxiv icon