Picture for Bing Li

Bing Li

Canon Medical Systems

SAM-Guided Masked Token Prediction for 3D Scene Understanding

Add code
Oct 17, 2024
Figure 1 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 2 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 3 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Figure 4 for SAM-Guided Masked Token Prediction for 3D Scene Understanding
Viaarxiv icon

SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data

Add code
Oct 13, 2024
Figure 1 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 2 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 3 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Figure 4 for SynFER: Towards Boosting Facial Expression Recognition with Synthetic Data
Viaarxiv icon

Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression

Add code
Oct 02, 2024
Figure 1 for Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression
Figure 2 for Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression
Figure 3 for Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression
Figure 4 for Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression
Viaarxiv icon

Token Caching for Diffusion Transformer Acceleration

Add code
Sep 27, 2024
Figure 1 for Token Caching for Diffusion Transformer Acceleration
Figure 2 for Token Caching for Diffusion Transformer Acceleration
Figure 3 for Token Caching for Diffusion Transformer Acceleration
Figure 4 for Token Caching for Diffusion Transformer Acceleration
Viaarxiv icon

MobileIQA: Exploiting Mobile-level Diverse Opinion Network For No-Reference Image Quality Assessment Using Knowledge Distillation

Add code
Sep 02, 2024
Viaarxiv icon

TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks

Add code
Aug 20, 2024
Figure 1 for TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
Figure 2 for TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
Figure 3 for TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
Figure 4 for TrackNeRF: Bundle Adjusting NeRF from Sparse and Noisy Views via Feature Tracks
Viaarxiv icon

SuperEncoder: Towards Universal Neural Approximate Quantum State Preparation

Add code
Aug 10, 2024
Viaarxiv icon

MIBench: Evaluating Multimodal Large Language Models over Multiple Images

Add code
Jul 21, 2024
Figure 1 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 2 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 3 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 4 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Viaarxiv icon

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

Add code
Jul 10, 2024
Figure 1 for How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Figure 2 for How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Figure 3 for How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Figure 4 for How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
Viaarxiv icon

EA-VTR: Event-Aware Video-Text Retrieval

Add code
Jul 10, 2024
Figure 1 for EA-VTR: Event-Aware Video-Text Retrieval
Figure 2 for EA-VTR: Event-Aware Video-Text Retrieval
Figure 3 for EA-VTR: Event-Aware Video-Text Retrieval
Figure 4 for EA-VTR: Event-Aware Video-Text Retrieval
Viaarxiv icon