Picture for Hongsheng Li

Hongsheng Li

FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow

Add code
Jun 08, 2023
Viaarxiv icon

Context-TAP: Tracking Any Point Demands Spatial Context Features

Add code
Jun 03, 2023
Figure 1 for Context-TAP: Tracking Any Point Demands Spatial Context Features
Figure 2 for Context-TAP: Tracking Any Point Demands Spatial Context Features
Figure 3 for Context-TAP: Tracking Any Point Demands Spatial Context Features
Figure 4 for Context-TAP: Tracking Any Point Demands Spatial Context Features
Viaarxiv icon

A Unified Conditional Framework for Diffusion-based Image Restoration

Add code
May 31, 2023
Figure 1 for A Unified Conditional Framework for Diffusion-based Image Restoration
Figure 2 for A Unified Conditional Framework for Diffusion-based Image Restoration
Figure 3 for A Unified Conditional Framework for Diffusion-based Image Restoration
Figure 4 for A Unified Conditional Framework for Diffusion-based Image Restoration
Viaarxiv icon

Voxel2Hemodynamics: An End-to-end Deep Learning Method for Predicting Coronary Artery Hemodynamics

Add code
May 30, 2023
Viaarxiv icon

Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising

Add code
May 29, 2023
Figure 1 for Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Figure 2 for Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Figure 3 for Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Figure 4 for Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising
Viaarxiv icon

Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model

Add code
May 24, 2023
Figure 1 for Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Figure 2 for Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Figure 3 for Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Figure 4 for Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model
Viaarxiv icon

ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Add code
May 17, 2023
Figure 1 for ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Figure 2 for ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Figure 3 for ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Figure 4 for ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Viaarxiv icon

SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification

Add code
May 16, 2023
Viaarxiv icon

Segmentation and Vascular Vectorization for Coronary Artery by Geometry-based Cascaded Neural Network

Add code
May 07, 2023
Figure 1 for Segmentation and Vascular Vectorization for Coronary Artery by Geometry-based Cascaded Neural Network
Figure 2 for Segmentation and Vascular Vectorization for Coronary Artery by Geometry-based Cascaded Neural Network
Figure 3 for Segmentation and Vascular Vectorization for Coronary Artery by Geometry-based Cascaded Neural Network
Figure 4 for Segmentation and Vascular Vectorization for Coronary Artery by Geometry-based Cascaded Neural Network
Viaarxiv icon

Personalize Segment Anything Model with One Shot

Add code
May 04, 2023
Viaarxiv icon