Picture for Dongdong Zhang

Dongdong Zhang

AOI: Turning Failed Trajectories into Training Signals for Autonomous Cloud Diagnosis

Add code
Mar 05, 2026
Viaarxiv icon

From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics

Add code
Jan 30, 2026
Viaarxiv icon

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

Add code
Jan 20, 2026
Viaarxiv icon

From Word to World: Can Large Language Models be Implicit Text-based World Models?

Add code
Dec 21, 2025
Viaarxiv icon

VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models

Add code
Aug 13, 2025
Viaarxiv icon

Scaling Laws of Synthetic Data for Language Models

Add code
Mar 26, 2025
Figure 1 for Scaling Laws of Synthetic Data for Language Models
Figure 2 for Scaling Laws of Synthetic Data for Language Models
Figure 3 for Scaling Laws of Synthetic Data for Language Models
Figure 4 for Scaling Laws of Synthetic Data for Language Models
Viaarxiv icon

Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective

Add code
Jan 19, 2025
Viaarxiv icon

ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework

Add code
Oct 25, 2024
Figure 1 for ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
Figure 2 for ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
Figure 3 for ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
Figure 4 for ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
Viaarxiv icon

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

Add code
Feb 26, 2024
Figure 1 for Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Figure 2 for Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Figure 3 for Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Figure 4 for Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models
Viaarxiv icon

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Add code
Feb 20, 2024
Viaarxiv icon