Picture for Xi Li

Xi Li

Mark

TXL-PBC: a freely accessible labeled peripheral blood cell dataset

Add code
Jul 18, 2024
Figure 1 for TXL-PBC: a freely accessible labeled peripheral blood cell dataset
Figure 2 for TXL-PBC: a freely accessible labeled peripheral blood cell dataset
Figure 3 for TXL-PBC: a freely accessible labeled peripheral blood cell dataset
Figure 4 for TXL-PBC: a freely accessible labeled peripheral blood cell dataset
Viaarxiv icon

CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition

Add code
Jul 04, 2024
Figure 1 for CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition
Figure 2 for CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition
Figure 3 for CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition
Figure 4 for CLASH: Complementary Learning with Neural Architecture Search for Gait Recognition
Viaarxiv icon

GVDIFF: Grounded Text-to-Video Generation with Diffusion Models

Add code
Jul 02, 2024
Figure 1 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 2 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 3 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Figure 4 for GVDIFF: Grounded Text-to-Video Generation with Diffusion Models
Viaarxiv icon

Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model

Add code
Jun 28, 2024
Viaarxiv icon

ScanFormer: Referring Expression Comprehension by Iteratively Scanning

Add code
Jun 26, 2024
Viaarxiv icon

SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation

Add code
Jun 15, 2024
Viaarxiv icon

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection

Add code
Jun 13, 2024
Figure 1 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 2 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 3 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 4 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Viaarxiv icon

Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models

Add code
Jun 10, 2024
Figure 1 for Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Figure 2 for Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Figure 3 for Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Figure 4 for Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Viaarxiv icon

CityCraft: A Real Crafter for 3D City Generation

Add code
Jun 07, 2024
Viaarxiv icon

CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation

Add code
May 17, 2024
Viaarxiv icon