Picture for Yao Li

Yao Li

Let's Think with Images Efficiently! An Interleaved-Modal Chain-of-Thought Reasoning Framework with Dynamic and Precise Visual Thoughts

Add code
Mar 23, 2026
Viaarxiv icon

An FPGA Implementation of Displacement Vector Search for Intra Pattern Copy in JPEG XS

Add code
Mar 11, 2026
Viaarxiv icon

FAVLA: A Force-Adaptive Fast-Slow VLA model for Contact-Rich Robotic Manipulation

Add code
Feb 27, 2026
Viaarxiv icon

Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis

Add code
Dec 21, 2025
Viaarxiv icon

PDAC: Efficient Coreset Selection for Continual Learning via Probability Density Awareness

Add code
Nov 12, 2025
Viaarxiv icon

Towards Frequency-Adaptive Learning for SAR Despeckling

Add code
Nov 08, 2025
Viaarxiv icon

SymCode: A Neurosymbolic Approach to Mathematical Reasoning via Verifiable Code Generation

Add code
Oct 29, 2025
Viaarxiv icon

In-Loop Filtering Using Learned Look-Up Tables for Video Coding

Add code
Sep 11, 2025
Viaarxiv icon

Single Index Bandits: Generalized Linear Contextual Bandits with Unknown Reward Functions

Add code
Jun 15, 2025
Viaarxiv icon

Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation

Add code
Apr 27, 2025
Figure 1 for Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation
Figure 2 for Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation
Figure 3 for Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation
Viaarxiv icon