Picture for Xiaomeng Yang

Xiaomeng Yang

Towards Autonomous Mathematics Research

Add code
Feb 12, 2026
Viaarxiv icon

Omni-Video 2: Scaling MLLM-Conditioned Diffusion for Unified Video Generation and Editing

Add code
Feb 09, 2026
Viaarxiv icon

A unified multimodal understanding and generation model for cross-disciplinary scientific research

Add code
Jan 04, 2026
Viaarxiv icon

ZeroSim: Zero-Shot Analog Circuit Evaluation with Unified Transformer Embeddings

Add code
Nov 10, 2025
Viaarxiv icon

Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision

Add code
Aug 07, 2025
Figure 1 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 2 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 3 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Figure 4 for Uni-cot: Towards Unified Chain-of-Thought Reasoning Across Text and Vision
Viaarxiv icon

SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training

Add code
May 28, 2025
Figure 1 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 2 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 3 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 4 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Viaarxiv icon

ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation

Add code
May 27, 2025
Viaarxiv icon

Visual Text Processing: A Comprehensive Review and Unified Evaluation

Add code
Apr 30, 2025
Figure 1 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 2 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 3 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Figure 4 for Visual Text Processing: A Comprehensive Review and Unified Evaluation
Viaarxiv icon

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition

Add code
Mar 24, 2025
Viaarxiv icon

Cockatiel: Ensembling Synthetic and Human Preferenced Training for Detailed Video Caption

Add code
Mar 12, 2025
Viaarxiv icon