Picture for Ximeng Sun

Ximeng Sun

CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models

Add code
Jan 05, 2026
Viaarxiv icon

Instella: Fully Open Language Models with Stellar Performance

Add code
Nov 14, 2025
Viaarxiv icon

Learning from Online Videos at Inference Time for Computer-Use Agents

Add code
Nov 06, 2025
Figure 1 for Learning from Online Videos at Inference Time for Computer-Use Agents
Figure 2 for Learning from Online Videos at Inference Time for Computer-Use Agents
Figure 3 for Learning from Online Videos at Inference Time for Computer-Use Agents
Figure 4 for Learning from Online Videos at Inference Time for Computer-Use Agents
Viaarxiv icon

Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation

Add code
Jun 26, 2025
Viaarxiv icon

Unleashing Hour-Scale Video Training for Long Video-Language Understanding

Add code
Jun 05, 2025
Viaarxiv icon

MOVi: Training-free Text-conditioned Multi-Object Video Generation

Add code
May 29, 2025
Viaarxiv icon

KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation

Add code
Apr 13, 2025
Viaarxiv icon

Self-Taught Agentic Long Context Understanding

Add code
Feb 21, 2025
Viaarxiv icon

Agent Laboratory: Using LLM Agents as Research Assistants

Add code
Jan 08, 2025
Figure 1 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 2 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 3 for Agent Laboratory: Using LLM Agents as Research Assistants
Figure 4 for Agent Laboratory: Using LLM Agents as Research Assistants
Viaarxiv icon

SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer

Add code
Dec 14, 2024
Figure 1 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 2 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 3 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Figure 4 for SoftVQ-VAE: Efficient 1-Dimensional Continuous Tokenizer
Viaarxiv icon