Picture for Renrui Zhang

Renrui Zhang

Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO

Add code
May 22, 2025
Viaarxiv icon

UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens

Add code
May 20, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Figure 1 for Seed1.5-VL Technical Report
Figure 2 for Seed1.5-VL Technical Report
Figure 3 for Seed1.5-VL Technical Report
Figure 4 for Seed1.5-VL Technical Report
Viaarxiv icon

T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT

Add code
May 01, 2025
Viaarxiv icon

TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving

Add code
Apr 22, 2025
Figure 1 for TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Figure 2 for TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Figure 3 for TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Figure 4 for TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving
Viaarxiv icon

From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Add code
Apr 22, 2025
Viaarxiv icon

Detect Anything 3D in the Wild

Add code
Apr 10, 2025
Viaarxiv icon

Concept-as-Tree: Synthetic Data is All You Need for VLM Personalization

Add code
Mar 17, 2025
Viaarxiv icon

3DAxisPrompt: Promoting the 3D Grounding and Reasoning in GPT-4o

Add code
Mar 17, 2025
Viaarxiv icon

PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models

Add code
Mar 13, 2025
Figure 1 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 2 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 3 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Figure 4 for PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Viaarxiv icon