Picture for Mingxuan Li

Mingxuan Li

From Pixels to Words -- Towards Native One-Vision Models at Scale

Add code
May 27, 2026
Viaarxiv icon

AnchorSeg: Language Grounded Query Banks for Reasoning Segmentation

Add code
Apr 22, 2026
Viaarxiv icon

NeuroLoRA: Context-Aware Neuromodulation for Parameter-Efficient Multi-Task Adaptation

Add code
Mar 12, 2026
Viaarxiv icon

AmbiBench: Benchmarking Mobile GUI Agents Beyond One-Shot Instructions in the Wild

Add code
Feb 12, 2026
Viaarxiv icon

Confounding Robust Continuous Control via Automatic Reward Shaping

Add code
Feb 10, 2026
Viaarxiv icon

DR.Experts: Differential Refinement of Distortion-Aware Experts for Blind Image Quality Assessment

Add code
Feb 10, 2026
Viaarxiv icon

Causal Flow Q-Learning for Robust Offline Reinforcement Learning

Add code
Feb 02, 2026
Viaarxiv icon

IFG: Internet-Scale Guidance for Functional Grasping Generation

Add code
Nov 12, 2025
Figure 1 for IFG: Internet-Scale Guidance for Functional Grasping Generation
Figure 2 for IFG: Internet-Scale Guidance for Functional Grasping Generation
Figure 3 for IFG: Internet-Scale Guidance for Functional Grasping Generation
Figure 4 for IFG: Internet-Scale Guidance for Functional Grasping Generation
Viaarxiv icon

Solvaformer: an SE(3)-equivariant graph transformer for small molecule solubility prediction

Add code
Nov 12, 2025
Figure 1 for Solvaformer: an SE(3)-equivariant graph transformer for small molecule solubility prediction
Figure 2 for Solvaformer: an SE(3)-equivariant graph transformer for small molecule solubility prediction
Figure 3 for Solvaformer: an SE(3)-equivariant graph transformer for small molecule solubility prediction
Figure 4 for Solvaformer: an SE(3)-equivariant graph transformer for small molecule solubility prediction
Viaarxiv icon

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Add code
Oct 16, 2025
Viaarxiv icon