Picture for Hongxing Li

Hongxing Li

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Add code
Aug 07, 2025
Viaarxiv icon

TCSAFormer: Efficient Vision Transformer with Token Compression and Sparse Attention for Medical Image Segmentation

Add code
Aug 06, 2025
Viaarxiv icon

MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention

Add code
Jul 03, 2025
Viaarxiv icon

Cross-Modal Clustering-Guided Negative Sampling for Self-Supervised Joint Learning from Medical Images and Reports

Add code
Jun 13, 2025
Viaarxiv icon

DMAF-Net: An Effective Modality Rebalancing Framework for Incomplete Multi-Modal Medical Image Segmentation

Add code
Jun 13, 2025
Viaarxiv icon

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

Add code
May 27, 2025
Viaarxiv icon