Picture for Xiang Bai

Xiang Bai

Huazhong University of Science and Technology

Attention-Guided Perturbation for Unsupervised Image Anomaly Detection

Add code
Aug 14, 2024
Viaarxiv icon

Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models

Add code
Aug 09, 2024
Figure 1 for Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models
Figure 2 for Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models
Figure 3 for Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models
Figure 4 for Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models
Viaarxiv icon

Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping

Add code
Aug 04, 2024
Figure 1 for Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping
Figure 2 for Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping
Figure 3 for Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping
Figure 4 for Mini-Monkey: Alleviate the Sawtooth Effect by Multi-Scale Adaptive Cropping
Viaarxiv icon

WAS: Dataset and Methods for Artistic Text Segmentation

Add code
Jul 31, 2024
Figure 1 for WAS: Dataset and Methods for Artistic Text Segmentation
Figure 2 for WAS: Dataset and Methods for Artistic Text Segmentation
Figure 3 for WAS: Dataset and Methods for Artistic Text Segmentation
Figure 4 for WAS: Dataset and Methods for Artistic Text Segmentation
Viaarxiv icon

LION: Linear Group RNN for 3D Object Detection in Point Clouds

Add code
Jul 25, 2024
Figure 1 for LION: Linear Group RNN for 3D Object Detection in Point Clouds
Figure 2 for LION: Linear Group RNN for 3D Object Detection in Point Clouds
Figure 3 for LION: Linear Group RNN for 3D Object Detection in Point Clouds
Figure 4 for LION: Linear Group RNN for 3D Object Detection in Point Clouds
Viaarxiv icon

PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects

Add code
Jul 23, 2024
Viaarxiv icon

SEED: A Simple and Effective 3D DETR in Point Clouds

Add code
Jul 15, 2024
Figure 1 for SEED: A Simple and Effective 3D DETR in Point Clouds
Figure 2 for SEED: A Simple and Effective 3D DETR in Point Clouds
Figure 3 for SEED: A Simple and Effective 3D DETR in Point Clouds
Figure 4 for SEED: A Simple and Effective 3D DETR in Point Clouds
Viaarxiv icon

OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

Add code
Jul 15, 2024
Figure 1 for OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Figure 2 for OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Figure 3 for OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Figure 4 for OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection
Viaarxiv icon

A Unified Framework for 3D Scene Understanding

Add code
Jul 03, 2024
Figure 1 for A Unified Framework for 3D Scene Understanding
Figure 2 for A Unified Framework for 3D Scene Understanding
Figure 3 for A Unified Framework for 3D Scene Understanding
Figure 4 for A Unified Framework for 3D Scene Understanding
Viaarxiv icon

SOOD++: Leveraging Unlabeled Data to Boost Oriented Object Detection

Add code
Jul 01, 2024
Viaarxiv icon