Picture for Hongyi Cai

Hongyi Cai

VisNec: Measuring and Leveraging Visual Necessity for Multimodal Instruction Tuning

Add code
Mar 01, 2026
Viaarxiv icon

When Vision Meets Texts in Listwise Reranking

Add code
Jan 28, 2026
Viaarxiv icon

Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment

Add code
Nov 06, 2025
Viaarxiv icon

Enhancing Large Language Models' Situated Faithfulness to External Contexts

Add code
Oct 18, 2024
Figure 1 for Enhancing Large Language Models' Situated Faithfulness to External Contexts
Figure 2 for Enhancing Large Language Models' Situated Faithfulness to External Contexts
Figure 3 for Enhancing Large Language Models' Situated Faithfulness to External Contexts
Figure 4 for Enhancing Large Language Models' Situated Faithfulness to External Contexts
Viaarxiv icon

AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration

Add code
Sep 10, 2024
Figure 1 for AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration
Figure 2 for AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration
Figure 3 for AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration
Figure 4 for AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration
Viaarxiv icon

CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection

Add code
Apr 23, 2024
Figure 1 for CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection
Figure 2 for CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection
Figure 3 for CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection
Figure 4 for CFPFormer: Feature-pyramid like Transformer Decoder for Segmentation and Detection
Viaarxiv icon

AccidentBlip2: Accident Detection With Multi-View MotionBlip2

Add code
Apr 19, 2024
Figure 1 for AccidentBlip2: Accident Detection With Multi-View MotionBlip2
Figure 2 for AccidentBlip2: Accident Detection With Multi-View MotionBlip2
Figure 3 for AccidentBlip2: Accident Detection With Multi-View MotionBlip2
Figure 4 for AccidentBlip2: Accident Detection With Multi-View MotionBlip2
Viaarxiv icon