Picture for Xi Li

Xi Li

Mark

Ego3DT: Tracking Every 3D Object in Ego-centric Videos

Add code
Oct 11, 2024
Figure 1 for Ego3DT: Tracking Every 3D Object in Ego-centric Videos
Viaarxiv icon

Smart Audit System Empowered by LLM

Add code
Oct 10, 2024
Figure 1 for Smart Audit System Empowered by LLM
Figure 2 for Smart Audit System Empowered by LLM
Figure 3 for Smart Audit System Empowered by LLM
Figure 4 for Smart Audit System Empowered by LLM
Viaarxiv icon

Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown

Add code
Sep 14, 2024
Figure 1 for Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Figure 2 for Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Figure 3 for Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Figure 4 for Associate Everything Detected: Facilitating Tracking-by-Detection to the Unknown
Viaarxiv icon

Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment

Add code
Sep 10, 2024
Figure 1 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 2 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 3 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Figure 4 for Improving Multimodal Emotion Recognition by Leveraging Acoustic Adaptation and Visual Alignment
Viaarxiv icon

Hybrid Mask Generation for Infrared Small Target Detection with Single-Point Supervision

Add code
Sep 06, 2024
Figure 1 for Hybrid Mask Generation for Infrared Small Target Detection with Single-Point Supervision
Figure 2 for Hybrid Mask Generation for Infrared Small Target Detection with Single-Point Supervision
Figure 3 for Hybrid Mask Generation for Infrared Small Target Detection with Single-Point Supervision
Figure 4 for Hybrid Mask Generation for Infrared Small Target Detection with Single-Point Supervision
Viaarxiv icon

CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities

Add code
Aug 23, 2024
Figure 1 for CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Figure 2 for CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Figure 3 for CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Figure 4 for CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
Viaarxiv icon

Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning

Add code
Aug 22, 2024
Figure 1 for Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Figure 2 for Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Figure 3 for Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Figure 4 for Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
Viaarxiv icon

TXL-PBC: a freely accessible labeled peripheral blood cell dataset

Add code
Jul 18, 2024
Figure 1 for TXL-PBC: a freely accessible labeled peripheral blood cell dataset
Figure 2 for TXL-PBC: a freely accessible labeled peripheral blood cell dataset
Figure 3 for TXL-PBC: a freely accessible labeled peripheral blood cell dataset
Figure 4 for TXL-PBC: a freely accessible labeled peripheral blood cell dataset
Viaarxiv icon

CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition

Add code
Jul 04, 2024
Figure 1 for CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition
Figure 2 for CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition
Figure 3 for CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition
Figure 4 for CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition
Viaarxiv icon

GVDIFF: Grounded Text-to-Video Generation with Diffusion Models

Add code
Jul 02, 2024
Figure 1 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 2 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 3 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 4 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Viaarxiv icon