Picture for Liang Wang

Liang Wang

Institute of Automation, CAS

DiffSpectra: Molecular Structure Elucidation from Spectra using Diffusion Models

Add code
Jul 09, 2025
Viaarxiv icon

EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow

Add code
Jul 08, 2025
Figure 1 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 2 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 3 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Figure 4 for EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow
Viaarxiv icon

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

Add code
Jun 11, 2025
Figure 1 for Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
Figure 2 for Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
Figure 3 for Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
Figure 4 for Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
Viaarxiv icon

VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks

Add code
Jun 10, 2025
Figure 1 for VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks
Figure 2 for VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks
Figure 3 for VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks
Figure 4 for VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks
Viaarxiv icon

BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG

Add code
May 27, 2025
Figure 1 for Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG
Figure 2 for Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG
Figure 3 for Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG
Figure 4 for Divide-Then-Align: Honest Alignment based on the Knowledge Boundary of RAG
Viaarxiv icon

Reinforcing General Reasoning without Verifiers

Add code
May 27, 2025
Viaarxiv icon

REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing

Add code
May 25, 2025
Figure 1 for REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing
Figure 2 for REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing
Figure 3 for REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing
Figure 4 for REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing
Viaarxiv icon

AuroRA: Breaking Low-Rank Bottleneck of LoRA with Nonlinear Mapping

Add code
May 24, 2025
Viaarxiv icon

Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey

Add code
May 22, 2025
Figure 1 for Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey
Figure 2 for Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey
Figure 3 for Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey
Figure 4 for Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey
Viaarxiv icon