Picture for Tong He

Tong He

Depth Any Video with Scalable Synthetic Data

Add code
Oct 14, 2024
Figure 1 for Depth Any Video with Scalable Synthetic Data
Figure 2 for Depth Any Video with Scalable Synthetic Data
Figure 3 for Depth Any Video with Scalable Synthetic Data
Figure 4 for Depth Any Video with Scalable Synthetic Data
Viaarxiv icon

VideoSAM: Open-World Video Segmentation

Add code
Oct 11, 2024
Figure 1 for VideoSAM: Open-World Video Segmentation
Figure 2 for VideoSAM: Open-World Video Segmentation
Figure 3 for VideoSAM: Open-World Video Segmentation
Figure 4 for VideoSAM: Open-World Video Segmentation
Viaarxiv icon

SPA: 3D Spatial-Awareness Enables Effective Embodied Representation

Add code
Oct 10, 2024
Figure 1 for SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Figure 2 for SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Figure 3 for SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Figure 4 for SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
Viaarxiv icon

StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting

Add code
Oct 06, 2024
Figure 1 for StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting
Figure 2 for StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting
Figure 3 for StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting
Figure 4 for StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting
Viaarxiv icon

One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos

Add code
Sep 29, 2024
Figure 1 for One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Figure 2 for One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Figure 3 for One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Figure 4 for One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Viaarxiv icon

GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction

Add code
Sep 10, 2024
Viaarxiv icon

Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation

Add code
Sep 07, 2024
Viaarxiv icon

DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting

Add code
Aug 26, 2024
Viaarxiv icon

ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction

Add code
Aug 22, 2024
Figure 1 for ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction
Figure 2 for ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction
Figure 3 for ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction
Figure 4 for ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction
Viaarxiv icon

NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction

Add code
Aug 19, 2024
Viaarxiv icon