Picture for Lichao Sun

Lichao Sun

Lehigh University

BLURR: A Boosted Low-Resource Inference for Vision-Language-Action Models

Add code
Dec 12, 2025
Viaarxiv icon

3D4D: An Interactive, Editable, 4D World Model via 3D Video Generation

Add code
Nov 11, 2025
Viaarxiv icon

A Survey of AI Scientists: Surveying the automatic Scientists and Research

Add code
Oct 27, 2025
Viaarxiv icon

RadFabric: Agentic AI System with Reasoning Capability for Radiology

Add code
Jun 17, 2025
Viaarxiv icon

Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents

Add code
May 29, 2025
Viaarxiv icon

Advancing Multimodal Reasoning via Reinforcement Learning with Cold Start

Add code
May 28, 2025
Viaarxiv icon

Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO

Add code
May 28, 2025
Viaarxiv icon

Decision Flow Policy Optimization

Add code
May 26, 2025
Viaarxiv icon

Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning

Add code
May 23, 2025
Viaarxiv icon