Picture for Yingli Shen

Yingli Shen

From Unaligned to Aligned: Scaling Multilingual LLMs with Multi-Way Parallel Corpora

Add code
May 20, 2025
Viaarxiv icon

GLTW: Joint Improved Graph Transformer and LLM via Three-Word Language for Knowledge Graph Completion

Add code
Feb 17, 2025
Viaarxiv icon

DCAD-2000: A Multilingual Dataset across 2000+ Languages with Data Cleaning as Anomaly Detection

Add code
Feb 17, 2025
Viaarxiv icon