Picture for Liang Wang

Liang Wang

Institute of Automation, CAS

GenPilot: A Multi-Agent System for Test-Time Prompt Optimization in Image Generation

Add code
Oct 08, 2025
Viaarxiv icon

Variational Reasoning for Language Models

Add code
Sep 26, 2025
Figure 1 for Variational Reasoning for Language Models
Figure 2 for Variational Reasoning for Language Models
Figure 3 for Variational Reasoning for Language Models
Figure 4 for Variational Reasoning for Language Models
Viaarxiv icon

EgoDemoGen: Novel Egocentric Demonstration Generation Enables Viewpoint-Robust Manipulation

Add code
Sep 26, 2025
Viaarxiv icon

BaseReward: A Strong Baseline for Multimodal Reward Model

Add code
Sep 19, 2025
Viaarxiv icon

Solving the Min-Max Multiple Traveling Salesmen Problem via Learning-Based Path Generation and Optimal Splitting

Add code
Aug 23, 2025
Viaarxiv icon

AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation

Add code
Aug 21, 2025
Viaarxiv icon

Foundation Model for Skeleton-Based Human Action Understanding

Add code
Aug 18, 2025
Viaarxiv icon

DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation

Add code
Aug 06, 2025
Figure 1 for DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation
Figure 2 for DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation
Figure 3 for DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation
Figure 4 for DTPA: Dynamic Token-level Prefix Augmentation for Controllable Text Generation
Viaarxiv icon

OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use

Add code
Aug 06, 2025
Viaarxiv icon

Player-Centric Multimodal Prompt Generation for Large Language Model Based Identity-Aware Basketball Video Captioning

Add code
Jul 27, 2025
Viaarxiv icon