Picture for Weijian Luo

Weijian Luo

Multimodal OCR: Parse Anything from Documents

Add code
Mar 13, 2026
Viaarxiv icon

TDM-R1: Reinforcing Few-Step Diffusion Models with Non-Differentiable Reward

Add code
Mar 08, 2026
Viaarxiv icon

ZeroDiff++: Substantial Unseen Visual-semantic Correlation in Zero-shot Learning

Add code
Feb 12, 2026
Viaarxiv icon

Ultra Fast PDE Solving via Physics Guided Few-step Diffusion

Add code
Feb 03, 2026
Viaarxiv icon

Masked Auto-Regressive Variational Acceleration: Fast Inference Makes Practical Reinforcement Learning

Add code
Nov 19, 2025
Viaarxiv icon

Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation

Add code
Nov 18, 2025
Figure 1 for Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation
Figure 2 for Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation
Figure 3 for Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation
Figure 4 for Let Language Constrain Geometry: Vision-Language Models as Semantic and Spatial Critics for 3D Generation
Viaarxiv icon

Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation

Add code
Sep 19, 2025
Figure 1 for Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation
Figure 2 for Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation
Figure 3 for Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation
Figure 4 for Vision-Language Models as Differentiable Semantic and Spatial Rewards for Text-to-3D Generation
Viaarxiv icon

Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching

Add code
Jun 16, 2025
Viaarxiv icon

Uni-Instruct: One-step Diffusion Model through Unified Diffusion Divergence Instruction

Add code
May 27, 2025
Viaarxiv icon

Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation

Add code
Mar 17, 2025
Viaarxiv icon