Picture for Yun Li

Yun Li

CoViPAL: Layer-wise Contextualized Visual Token Pruning for Large Vision-Language Models

Add code
Aug 24, 2025
Viaarxiv icon

Multimodal Medical Endoscopic Image Analysis via Progressive Disentangle-aware Contrastive Learning

Add code
Aug 23, 2025
Figure 1 for Multimodal Medical Endoscopic Image Analysis via Progressive Disentangle-aware Contrastive Learning
Figure 2 for Multimodal Medical Endoscopic Image Analysis via Progressive Disentangle-aware Contrastive Learning
Figure 3 for Multimodal Medical Endoscopic Image Analysis via Progressive Disentangle-aware Contrastive Learning
Figure 4 for Multimodal Medical Endoscopic Image Analysis via Progressive Disentangle-aware Contrastive Learning
Viaarxiv icon

Automated CAD Modeling Sequence Generation from Text Descriptions via Transformer-Based Large Language Models

Add code
May 26, 2025
Viaarxiv icon

CellTypeAgent: Trustworthy cell type annotation with Large Language Models

Add code
May 13, 2025
Viaarxiv icon

STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?

Add code
Mar 31, 2025
Viaarxiv icon

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding

Add code
Mar 18, 2025
Viaarxiv icon

Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection

Add code
Mar 12, 2025
Viaarxiv icon

Better Process Supervision with Bi-directional Rewarding Signals

Add code
Mar 06, 2025
Viaarxiv icon

Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems

Add code
Feb 21, 2025
Figure 1 for Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems
Figure 2 for Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems
Figure 3 for Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems
Figure 4 for Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems
Viaarxiv icon

Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning

Add code
Feb 19, 2025
Figure 1 for Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning
Figure 2 for Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning
Figure 3 for Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning
Figure 4 for Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning
Viaarxiv icon