Picture for Yuke Li

Yuke Li

Joint Design of Doppler-Resilient Unimodular Discrete-Phase Waveforms and Receiving Filters for MIMO Radars

Add code
Apr 28, 2026
Viaarxiv icon

DiffVC: A Non-autoregressive Framework Based on Diffusion Model for Video Captioning

Add code
Apr 09, 2026
Viaarxiv icon

EPIR: An Efficient Patch Tokenization, Integration and Representation Framework for Micro-expression Recognition

Add code
Apr 09, 2026
Viaarxiv icon

SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition

Add code
Mar 18, 2026
Viaarxiv icon

SELECT: Detecting Label Errors in Real-world Scene Text Data

Add code
Dec 16, 2025
Figure 1 for SELECT: Detecting Label Errors in Real-world Scene Text Data
Figure 2 for SELECT: Detecting Label Errors in Real-world Scene Text Data
Figure 3 for SELECT: Detecting Label Errors in Real-world Scene Text Data
Figure 4 for SELECT: Detecting Label Errors in Real-world Scene Text Data
Viaarxiv icon

XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation

Add code
Aug 12, 2025
Figure 1 for XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation
Figure 2 for XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation
Figure 3 for XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation
Figure 4 for XEmoRAG: Cross-Lingual Emotion Transfer with Controllable Intensity Using Retrieval-Augmented Generation
Viaarxiv icon

CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians

Add code
May 28, 2025
Figure 1 for CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians
Figure 2 for CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians
Figure 3 for CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians
Figure 4 for CityGo: Lightweight Urban Modeling and Rendering with Proxy Buildings and Residual Gaussians
Viaarxiv icon

Riemannian Complex Hermit Positive Definite Convolution Network for Polarimetric SAR Image Classification

Add code
Feb 12, 2025
Viaarxiv icon

FleSpeech: Flexibly Controllable Speech Generation with Various Prompts

Add code
Jan 08, 2025
Figure 1 for FleSpeech: Flexibly Controllable Speech Generation with Various Prompts
Figure 2 for FleSpeech: Flexibly Controllable Speech Generation with Various Prompts
Figure 3 for FleSpeech: Flexibly Controllable Speech Generation with Various Prompts
Figure 4 for FleSpeech: Flexibly Controllable Speech Generation with Various Prompts
Viaarxiv icon

CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion

Add code
Dec 03, 2024
Figure 1 for CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Figure 2 for CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Figure 3 for CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Figure 4 for CoDiff-VC: A Codec-Assisted Diffusion Model for Zero-shot Voice Conversion
Viaarxiv icon