Picture for Shansan Gong

Shansan Gong

DreamOn: Diffusion Language Models For Code Infilling Beyond Fixed-size Canvas

Add code
Feb 01, 2026
Viaarxiv icon

OVD: On-policy Verbal Distillation

Add code
Jan 29, 2026
Viaarxiv icon

Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone

Add code
Dec 27, 2025
Viaarxiv icon

EconProver: Towards More Economical Test-Time Scaling for Automated Theorem Proving

Add code
Sep 16, 2025
Viaarxiv icon

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Add code
Jun 26, 2025
Viaarxiv icon

GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models

Add code
Dec 17, 2024
Figure 1 for GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models
Figure 2 for GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models
Figure 3 for GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models
Figure 4 for GIRAFFE: Design Choices for Extending the Context Length of Visual Language Models
Viaarxiv icon

Why Does the Effective Context Length of LLMs Fall Short?

Add code
Oct 24, 2024
Figure 1 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 2 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 3 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 4 for Why Does the Effective Context Length of LLMs Fall Short?
Viaarxiv icon

Scaling Diffusion Language Models via Adaptation from Autoregressive Models

Add code
Oct 23, 2024
Figure 1 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 2 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 3 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Figure 4 for Scaling Diffusion Language Models via Adaptation from Autoregressive Models
Viaarxiv icon

Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning

Add code
Oct 18, 2024
Figure 1 for Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Figure 2 for Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Figure 3 for Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Figure 4 for Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Viaarxiv icon

Training-Free Long-Context Scaling of Large Language Models

Add code
Feb 27, 2024
Viaarxiv icon