Picture for Shi-Min Hu

Shi-Min Hu

Beyond Inpainting: Unleash 3D Understanding for Precise Camera-Controlled Video Generation

Add code
Jan 15, 2026
Viaarxiv icon

DEER: Draft with Diffusion, Verify with Autoregressive Models

Add code
Dec 17, 2025
Figure 1 for DEER: Draft with Diffusion, Verify with Autoregressive Models
Figure 2 for DEER: Draft with Diffusion, Verify with Autoregressive Models
Figure 3 for DEER: Draft with Diffusion, Verify with Autoregressive Models
Figure 4 for DEER: Draft with Diffusion, Verify with Autoregressive Models
Viaarxiv icon

NeuralSSD: A Neural Solver for Signed Distance Surface Reconstruction

Add code
Nov 18, 2025
Viaarxiv icon

GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation

Add code
Nov 13, 2025
Figure 1 for GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation
Figure 2 for GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation
Figure 3 for GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation
Figure 4 for GPR: Towards a Generative Pre-trained One-Model Paradigm for Large-Scale Advertising Recommendation
Viaarxiv icon

Recovering Complete Actions for Cross-dataset Skeleton Action Recognition

Add code
Oct 31, 2024
Figure 1 for Recovering Complete Actions for Cross-dataset Skeleton Action Recognition
Figure 2 for Recovering Complete Actions for Cross-dataset Skeleton Action Recognition
Figure 3 for Recovering Complete Actions for Cross-dataset Skeleton Action Recognition
Figure 4 for Recovering Complete Actions for Cross-dataset Skeleton Action Recognition
Viaarxiv icon

CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization

Add code
Feb 28, 2024
Viaarxiv icon

Semantic-Aware Transformation-Invariant RoI Align

Add code
Dec 15, 2023
Figure 1 for Semantic-Aware Transformation-Invariant RoI Align
Figure 2 for Semantic-Aware Transformation-Invariant RoI Align
Figure 3 for Semantic-Aware Transformation-Invariant RoI Align
Figure 4 for Semantic-Aware Transformation-Invariant RoI Align
Viaarxiv icon

DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion

Add code
May 04, 2023
Figure 1 for DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion
Figure 2 for DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion
Figure 3 for DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion
Figure 4 for DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion
Viaarxiv icon

Long Range Pooling for 3D Large-Scale Scene Understanding

Add code
Jan 17, 2023
Viaarxiv icon

SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation

Add code
Sep 18, 2022
Figure 1 for SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
Figure 2 for SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
Figure 3 for SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
Figure 4 for SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
Viaarxiv icon