Picture for Da Yin

Da Yin

Violet

LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training

Add code
Oct 16, 2025
Viaarxiv icon

DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation

Add code
Oct 16, 2025
Viaarxiv icon

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Add code
Aug 08, 2025
Viaarxiv icon

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Viaarxiv icon

Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence

Add code
Jun 18, 2025
Viaarxiv icon

RefTool: Enhancing Model Reasoning with Reference-Guided Tool Creation

Add code
May 27, 2025
Viaarxiv icon

QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search

Add code
Feb 04, 2025
Figure 1 for QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
Figure 2 for QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
Figure 3 for QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
Figure 4 for QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search
Viaarxiv icon

Bridging the Data Provenance Gap Across Text, Speech and Video

Add code
Dec 19, 2024
Figure 1 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 2 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 3 for Bridging the Data Provenance Gap Across Text, Speech and Video
Figure 4 for Bridging the Data Provenance Gap Across Text, Speech and Video
Viaarxiv icon

SafeWorld: Geo-Diverse Safety Alignment

Add code
Dec 09, 2024
Viaarxiv icon

VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning

Add code
Dec 03, 2024
Viaarxiv icon