Picture for Jianan Wang

Jianan Wang

MiniMax-Remover: Taming Bad Noise Helps Video Object Removal

Add code
May 30, 2025
Viaarxiv icon

Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model

Add code
Feb 24, 2025
Viaarxiv icon

OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation

Add code
Dec 15, 2024
Figure 1 for OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Figure 2 for OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Figure 3 for OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Figure 4 for OccScene: Semantic Occupancy-based Cross-task Mutual Learning for 3D Scene Generation
Viaarxiv icon

DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion

Add code
Sep 25, 2024
Figure 1 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 2 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 3 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Figure 4 for DreamWaltz-G: Expressive 3D Gaussian Avatars from Skeleton-Guided 2D Diffusion
Viaarxiv icon

BeSimulator: A Large Language Model Powered Text-based Behavior Simulator

Add code
Sep 24, 2024
Figure 1 for BeSimulator: A Large Language Model Powered Text-based Behavior Simulator
Figure 2 for BeSimulator: A Large Language Model Powered Text-based Behavior Simulator
Figure 3 for BeSimulator: A Large Language Model Powered Text-based Behavior Simulator
Figure 4 for BeSimulator: A Large Language Model Powered Text-based Behavior Simulator
Viaarxiv icon

Technique Report of CVPR 2024 PBDL Challenges

Add code
Jun 15, 2024
Figure 1 for Technique Report of CVPR 2024 PBDL Challenges
Figure 2 for Technique Report of CVPR 2024 PBDL Challenges
Figure 3 for Technique Report of CVPR 2024 PBDL Challenges
Figure 4 for Technique Report of CVPR 2024 PBDL Challenges
Viaarxiv icon

HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding

Add code
Mar 28, 2024
Figure 1 for HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
Figure 2 for HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
Figure 3 for HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
Figure 4 for HGT: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding
Viaarxiv icon

CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, Controllability and Compatibility

Add code
Mar 18, 2024
Viaarxiv icon

Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription

Add code
Mar 16, 2024
Figure 1 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 2 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 3 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Figure 4 for Ctrl123: Consistent Novel View Synthesis via Closed-Loop Transcription
Viaarxiv icon

DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

Add code
Jan 09, 2024
Viaarxiv icon