Picture for Jonathan Huang

Jonathan Huang

Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors

Add code
Jul 14, 2024
Figure 1 for Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Figure 2 for Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Figure 3 for Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Figure 4 for Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors
Viaarxiv icon

Learning Hierarchical Semantic Classification by Grounding on Consistent Image Segmentations

Add code
Jun 17, 2024
Figure 1 for Learning Hierarchical Semantic Classification by Grounding on Consistent Image Segmentations
Figure 2 for Learning Hierarchical Semantic Classification by Grounding on Consistent Image Segmentations
Figure 3 for Learning Hierarchical Semantic Classification by Grounding on Consistent Image Segmentations
Figure 4 for Learning Hierarchical Semantic Classification by Grounding on Consistent Image Segmentations
Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Add code
Dec 21, 2023
Figure 1 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 2 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 3 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 4 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Viaarxiv icon

Text and Click inputs for unambiguous open vocabulary instance segmentation

Add code
Nov 24, 2023
Figure 1 for Text and Click inputs for unambiguous open vocabulary instance segmentation
Figure 2 for Text and Click inputs for unambiguous open vocabulary instance segmentation
Figure 3 for Text and Click inputs for unambiguous open vocabulary instance segmentation
Figure 4 for Text and Click inputs for unambiguous open vocabulary instance segmentation
Viaarxiv icon

Optimizing ViViT Training: Time and Memory Reduction for Action Recognition

Add code
Jun 07, 2023
Figure 1 for Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Figure 2 for Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Figure 3 for Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Figure 4 for Optimizing ViViT Training: Time and Memory Reduction for Action Recognition
Viaarxiv icon

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

Add code
Jun 02, 2023
Figure 1 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 2 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 3 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Figure 4 for DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model
Viaarxiv icon

Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations

Add code
May 03, 2023
Figure 1 for Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations
Figure 2 for Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations
Figure 3 for Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations
Figure 4 for Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations
Viaarxiv icon

Local Metrics for Multi-Object Tracking

Add code
Apr 06, 2021
Figure 1 for Local Metrics for Multi-Object Tracking
Figure 2 for Local Metrics for Multi-Object Tracking
Figure 3 for Local Metrics for Multi-Object Tracking
Figure 4 for Local Metrics for Multi-Object Tracking
Viaarxiv icon

The surprising impact of mask-head architecture on novel class segmentation

Add code
Apr 01, 2021
Figure 1 for The surprising impact of mask-head architecture on novel class segmentation
Figure 2 for The surprising impact of mask-head architecture on novel class segmentation
Figure 3 for The surprising impact of mask-head architecture on novel class segmentation
Figure 4 for The surprising impact of mask-head architecture on novel class segmentation
Viaarxiv icon

PERF-Net: Pose Empowered RGB-Flow Net

Add code
Sep 28, 2020
Figure 1 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 2 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 3 for PERF-Net: Pose Empowered RGB-Flow Net
Figure 4 for PERF-Net: Pose Empowered RGB-Flow Net
Viaarxiv icon