Picture for Yun Li

Yun Li

CoViPAL: Layer-wise Contextualized Visual Token Pruning for Large Vision-Language Models

Add code
Aug 24, 2025
Viaarxiv icon

Multimodal Medical Endoscopic Image Analysis via Progressive Disentangle-aware Contrastive Learning

Add code
Aug 23, 2025
Viaarxiv icon

Automated CAD Modeling Sequence Generation from Text Descriptions via Transformer-Based Large Language Models

Add code
May 26, 2025
Viaarxiv icon

CellTypeAgent: Trustworthy cell type annotation with Large Language Models

Add code
May 13, 2025
Viaarxiv icon

STI-Bench: Are MLLMs Ready for Precise Spatial-Temporal World Understanding?

Add code
Mar 31, 2025
Viaarxiv icon

MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding

Add code
Mar 18, 2025
Viaarxiv icon

Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection

Add code
Mar 12, 2025
Viaarxiv icon

Better Process Supervision with Bi-directional Rewarding Signals

Add code
Mar 06, 2025
Viaarxiv icon

Investigating the Adaptive Robustness with Knowledge Conflicts in LLM-based Multi-Agent Systems

Add code
Feb 21, 2025
Viaarxiv icon

Sce2DriveX: A Generalized MLLM Framework for Scene-to-Drive Learning

Add code
Feb 19, 2025
Viaarxiv icon