Picture for Sifei Liu

Sifei Liu

SSE: Multimodal Semantic Data Selection and Enrichment for Industrial-scale Data Assimilation

Add code
Sep 20, 2024
Figure 1 for SSE: Multimodal Semantic Data Selection and Enrichment for Industrial-scale Data Assimilation
Figure 2 for SSE: Multimodal Semantic Data Selection and Enrichment for Industrial-scale Data Assimilation
Figure 3 for SSE: Multimodal Semantic Data Selection and Enrichment for Industrial-scale Data Assimilation
Figure 4 for SSE: Multimodal Semantic Data Selection and Enrichment for Industrial-scale Data Assimilation
Viaarxiv icon

GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation

Add code
Jun 18, 2024
Figure 1 for GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Figure 2 for GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Figure 3 for GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Figure 4 for GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation
Viaarxiv icon

CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation

Add code
Jun 04, 2024
Figure 1 for CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
Figure 2 for CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
Figure 3 for CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
Figure 4 for CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation
Viaarxiv icon

SpatialRGPT: Grounded Spatial Reasoning in Vision Language Model

Add code
Jun 03, 2024
Viaarxiv icon

Compositional Text-to-Image Generation with Dense Blob Representations

Add code
May 14, 2024
Viaarxiv icon

HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data

Add code
Mar 18, 2024
Viaarxiv icon

RegionGPT: Towards Region Understanding Vision Language Model

Add code
Mar 04, 2024
Figure 1 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 2 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 3 for RegionGPT: Towards Region Understanding Vision Language Model
Figure 4 for RegionGPT: Towards Region Understanding Vision Language Model
Viaarxiv icon

RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos

Add code
Jan 24, 2024
Figure 1 for RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos
Figure 2 for RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos
Figure 3 for RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos
Figure 4 for RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos
Viaarxiv icon

AGG: Amortized Generative 3D Gaussians for Single Image to 3D

Add code
Jan 08, 2024
Viaarxiv icon

COLMAP-Free 3D Gaussian Splatting

Add code
Dec 12, 2023
Figure 1 for COLMAP-Free 3D Gaussian Splatting
Figure 2 for COLMAP-Free 3D Gaussian Splatting
Figure 3 for COLMAP-Free 3D Gaussian Splatting
Figure 4 for COLMAP-Free 3D Gaussian Splatting
Viaarxiv icon