Picture for Bohao Peng

Bohao Peng

Training-Free Efficient Video Generation via Dynamic Token Carving

Add code
May 22, 2025
Viaarxiv icon

VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning

Add code
May 17, 2025
Viaarxiv icon

Does Your Vision-Language Model Get Lost in the Long Video Sampling Dilemma?

Add code
Mar 16, 2025
Viaarxiv icon

Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement

Add code
Mar 09, 2025
Viaarxiv icon

Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers

Add code
Jan 07, 2025
Figure 1 for Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
Figure 2 for Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
Figure 3 for Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
Figure 4 for Magic Mirror: ID-Preserved Video Generation in Video Diffusion Transformers
Viaarxiv icon

ControlNeXt: Powerful and Efficient Control for Image and Video Generation

Add code
Aug 15, 2024
Figure 1 for ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Figure 2 for ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Figure 3 for ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Figure 4 for ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Viaarxiv icon

Scalable Language Model with Generalized Continual Learning

Add code
Apr 11, 2024
Figure 1 for Scalable Language Model with Generalized Continual Learning
Figure 2 for Scalable Language Model with Generalized Continual Learning
Figure 3 for Scalable Language Model with Generalized Continual Learning
Figure 4 for Scalable Language Model with Generalized Continual Learning
Viaarxiv icon

OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation

Add code
Mar 21, 2024
Viaarxiv icon

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

Add code
Mar 14, 2024
Figure 1 for GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Figure 2 for GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Figure 3 for GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Figure 4 for GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding
Viaarxiv icon

An Improved Baseline for Reasoning Segmentation with Large Language Model

Add code
Jan 03, 2024
Figure 1 for An Improved Baseline for Reasoning Segmentation with Large Language Model
Figure 2 for An Improved Baseline for Reasoning Segmentation with Large Language Model
Figure 3 for An Improved Baseline for Reasoning Segmentation with Large Language Model
Figure 4 for An Improved Baseline for Reasoning Segmentation with Large Language Model
Viaarxiv icon