Picture for Lilin Wang

Lilin Wang

Toward Scalable Terminal Task Synthesis via Skill Graphs

Add code
Apr 28, 2026
Viaarxiv icon

HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds

Add code
Apr 15, 2026
Viaarxiv icon

SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories

Add code
Dec 19, 2025
Figure 1 for SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories
Figure 2 for SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories
Figure 3 for SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories
Figure 4 for SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories
Viaarxiv icon

Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection

Add code
Oct 21, 2024
Figure 1 for Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection
Figure 2 for Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection
Figure 3 for Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection
Figure 4 for Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection
Viaarxiv icon