Picture for Bohan Zeng

Bohan Zeng

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification

Add code
Jun 08, 2025
Viaarxiv icon

MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

Add code
May 27, 2025
Viaarxiv icon

Let's Verify Math Questions Step by Step

Add code
May 20, 2025
Viaarxiv icon

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Add code
Apr 14, 2025
Viaarxiv icon

WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes

Add code
Mar 17, 2025
Viaarxiv icon

Any2AnyTryon: Leveraging Adaptive Position Embeddings for Versatile Virtual Clothing Tasks

Add code
Jan 27, 2025
Viaarxiv icon

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

Add code
Oct 11, 2024
Figure 1 for Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Figure 2 for Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Figure 3 for Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Figure 4 for Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Viaarxiv icon

Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis

Add code
Oct 09, 2024
Viaarxiv icon

EditWorld: Simulating World Dynamics for Instruction-Following Image Editing

Add code
May 23, 2024
Figure 1 for EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Figure 2 for EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Figure 3 for EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Figure 4 for EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Viaarxiv icon

ZONE: Zero-Shot Instruction-Guided Local Editing

Add code
Dec 28, 2023
Figure 1 for ZONE: Zero-Shot Instruction-Guided Local Editing
Figure 2 for ZONE: Zero-Shot Instruction-Guided Local Editing
Figure 3 for ZONE: Zero-Shot Instruction-Guided Local Editing
Figure 4 for ZONE: Zero-Shot Instruction-Guided Local Editing
Viaarxiv icon