Picture for Guangyao Zhai

Guangyao Zhai

VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs

Add code
Sep 30, 2024
Figure 1 for VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs
Figure 2 for VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs
Figure 3 for VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs
Figure 4 for VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs
Viaarxiv icon

EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

Add code
May 02, 2024
Viaarxiv icon

GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering

Add code
Mar 17, 2024
Viaarxiv icon

SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation

Add code
Nov 18, 2023
Figure 1 for SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
Figure 2 for SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
Figure 3 for SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
Figure 4 for SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation
Viaarxiv icon

ShapeMaker: Self-Supervised Joint Shape Canonicalization, Segmentation, Retrieval and Deformation

Add code
Nov 18, 2023
Viaarxiv icon

SG-Bot: Object Rearrangement via Coarse-to-Fine Robotic Imagination on Scene Graphs

Add code
Sep 21, 2023
Viaarxiv icon

DDF-HO: Hand-Held Object Reconstruction via Conditional Directed Distance Field

Add code
Aug 16, 2023
Viaarxiv icon

CCD-3DR: Consistent Conditioning in Diffusion for Single-Image 3D Reconstruction

Add code
Aug 15, 2023
Viaarxiv icon

CommonScenes: Generating Commonsense 3D Indoor Scenes with Scene Graphs

Add code
May 25, 2023
Viaarxiv icon

On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks

Add code
Mar 26, 2023
Viaarxiv icon