Picture for Borui Jiang

Borui Jiang

Towards Lossless Ultimate Vision Token Compression for VLMs

Add code
Dec 09, 2025
Viaarxiv icon

Positional Preservation Embedding for Multimodal Large Language Models

Add code
Oct 27, 2025
Viaarxiv icon

OmniEval: A Benchmark for Evaluating Omni-modal Models with Visual, Auditory, and Textual Inputs

Add code
Jun 26, 2025
Viaarxiv icon

Single Domain Generalization for Few-Shot Counting via Universal Representation Matching

Add code
May 22, 2025
Figure 1 for Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Figure 2 for Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Figure 3 for Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Figure 4 for Single Domain Generalization for Few-Shot Counting via Universal Representation Matching
Viaarxiv icon

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Add code
May 08, 2025
Figure 1 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 2 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 3 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Figure 4 for Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models
Viaarxiv icon

Deep High-Resolution Representation Learning for Visual Recognition

Add code
Aug 20, 2019
Figure 1 for Deep High-Resolution Representation Learning for Visual Recognition
Figure 2 for Deep High-Resolution Representation Learning for Visual Recognition
Figure 3 for Deep High-Resolution Representation Learning for Visual Recognition
Figure 4 for Deep High-Resolution Representation Learning for Visual Recognition
Viaarxiv icon

High-Resolution Representations for Labeling Pixels and Regions

Add code
Apr 09, 2019
Figure 1 for High-Resolution Representations for Labeling Pixels and Regions
Figure 2 for High-Resolution Representations for Labeling Pixels and Regions
Figure 3 for High-Resolution Representations for Labeling Pixels and Regions
Figure 4 for High-Resolution Representations for Labeling Pixels and Regions
Viaarxiv icon

Acquisition of Localization Confidence for Accurate Object Detection

Add code
Jul 30, 2018
Figure 1 for Acquisition of Localization Confidence for Accurate Object Detection
Figure 2 for Acquisition of Localization Confidence for Accurate Object Detection
Figure 3 for Acquisition of Localization Confidence for Accurate Object Detection
Figure 4 for Acquisition of Localization Confidence for Accurate Object Detection
Viaarxiv icon