Picture for Hongxing Li

Hongxing Li

Milestone-Guided Policy Learning for Long-Horizon Language Agents

Add code
May 07, 2026
Viaarxiv icon

SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments

Add code
Apr 15, 2026
Viaarxiv icon

Seeing but Not Thinking: Routing Distraction in Multimodal Mixture-of-Experts

Add code
Apr 09, 2026
Viaarxiv icon

OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks

Add code
Aug 07, 2025
Viaarxiv icon

TCSAFormer: Efficient Vision Transformer with Token Compression and Sparse Attention for Medical Image Segmentation

Add code
Aug 06, 2025
Viaarxiv icon

MedFormer: Hierarchical Medical Vision Transformer with Content-Aware Dual Sparse Selection Attention

Add code
Jul 03, 2025
Viaarxiv icon

Cross-Modal Clustering-Guided Negative Sampling for Self-Supervised Joint Learning from Medical Images and Reports

Add code
Jun 13, 2025
Viaarxiv icon

DMAF-Net: An Effective Modality Rebalancing Framework for Incomplete Multi-Modal Medical Image Segmentation

Add code
Jun 13, 2025
Viaarxiv icon

ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models

Add code
May 27, 2025
Figure 1 for ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Figure 2 for ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Figure 3 for ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Figure 4 for ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models
Viaarxiv icon