Picture for Xi Li

Xi Li

Mark

Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model

Add code
Jun 28, 2024
Figure 1 for Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Figure 2 for Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Figure 3 for Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Figure 4 for Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
Viaarxiv icon

ScanFormer: Referring Expression Comprehension by Iteratively Scanning

Add code
Jun 26, 2024
Figure 1 for ScanFormer: Referring Expression Comprehension by Iteratively Scanning
Figure 2 for ScanFormer: Referring Expression Comprehension by Iteratively Scanning
Figure 3 for ScanFormer: Referring Expression Comprehension by Iteratively Scanning
Figure 4 for ScanFormer: Referring Expression Comprehension by Iteratively Scanning
Viaarxiv icon

SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation

Add code
Jun 15, 2024
Figure 1 for SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation
Figure 2 for SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation
Figure 3 for SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation
Figure 4 for SemanticMIM: Marring Masked Image Modeling with Semantics Compression for General Visual Representation
Viaarxiv icon

BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection

Add code
Jun 13, 2024
Figure 1 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 2 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 3 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Figure 4 for BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection
Viaarxiv icon

Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models

Add code
Jun 10, 2024
Figure 1 for Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Figure 2 for Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Figure 3 for Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Figure 4 for Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
Viaarxiv icon

CityCraft: A Real Crafter for 3D City Generation

Add code
Jun 07, 2024
Figure 1 for CityCraft: A Real Crafter for 3D City Generation
Figure 2 for CityCraft: A Real Crafter for 3D City Generation
Figure 3 for CityCraft: A Real Crafter for 3D City Generation
Figure 4 for CityCraft: A Real Crafter for 3D City Generation
Viaarxiv icon

CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation

Add code
May 17, 2024
Figure 1 for CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation
Figure 2 for CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation
Figure 3 for CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation
Figure 4 for CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation
Viaarxiv icon

MovieChat+: Question-aware Sparse Memory for Long Video Question Answering

Add code
Apr 26, 2024
Viaarxiv icon

Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues

Add code
Apr 12, 2024
Figure 1 for Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues
Figure 2 for Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues
Figure 3 for Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues
Figure 4 for Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues
Viaarxiv icon

SphereDiffusion: Spherical Geometry-Aware Distortion Resilient Diffusion Model

Add code
Mar 15, 2024
Viaarxiv icon