Picture for Yongxiang Li

Yongxiang Li

Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis

Add code
May 27, 2025
Viaarxiv icon

Chain-of-Lure: A Synthetic Narrative-Driven Approach to Compromise Large Language Models

Add code
May 23, 2025
Viaarxiv icon

Table-R1: Region-based Reinforcement Learning for Table Understanding

Add code
May 18, 2025
Viaarxiv icon

Robust Duality Learning for Unsupervised Visible-Infrared Person Re-Identfication

Add code
May 05, 2025
Viaarxiv icon

GOAT-TTS: LLM-based Text-To-Speech Generation Optimized via A Dual-Branch Architecture

Add code
Apr 15, 2025
Viaarxiv icon

Locate-then-edit for Multi-hop Factual Recall under Knowledge Editing

Add code
Oct 08, 2024
Viaarxiv icon

Disentangle and denoise: Tackling context misalignment for video moment retrieval

Add code
Aug 14, 2024
Viaarxiv icon

52B to 1T: Lessons Learned via Tele-FLM Series

Add code
Jul 03, 2024
Figure 1 for 52B to 1T: Lessons Learned via Tele-FLM Series
Figure 2 for 52B to 1T: Lessons Learned via Tele-FLM Series
Figure 3 for 52B to 1T: Lessons Learned via Tele-FLM Series
Figure 4 for 52B to 1T: Lessons Learned via Tele-FLM Series
Viaarxiv icon

Tele-FLM Technical Report

Add code
Apr 25, 2024
Figure 1 for Tele-FLM Technical Report
Figure 2 for Tele-FLM Technical Report
Figure 3 for Tele-FLM Technical Report
Figure 4 for Tele-FLM Technical Report
Viaarxiv icon

ProTA: Probabilistic Token Aggregation for Text-Video Retrieval

Add code
Apr 18, 2024
Figure 1 for ProTA: Probabilistic Token Aggregation for Text-Video Retrieval
Figure 2 for ProTA: Probabilistic Token Aggregation for Text-Video Retrieval
Figure 3 for ProTA: Probabilistic Token Aggregation for Text-Video Retrieval
Figure 4 for ProTA: Probabilistic Token Aggregation for Text-Video Retrieval
Viaarxiv icon