Picture for Hsin-Ying Lee

Hsin-Ying Lee

CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection

Add code
Oct 12, 2022
Figure 1 for CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection
Figure 2 for CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection
Figure 3 for CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection
Figure 4 for CrossDTR: Cross-view and Depth-guided Transformers for 3D Object Detection
Viaarxiv icon

Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling

Add code
Oct 08, 2022
Figure 1 for Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling
Figure 2 for Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling
Figure 3 for Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling
Figure 4 for Learning Fine-Grained Visual Understanding for Video Question Answering via Decoupling Spatial-Temporal Modeling
Viaarxiv icon

Coarse-to-Fine Point Cloud Registration with SE-Equivariant Representations

Add code
Oct 05, 2022
Figure 1 for Coarse-to-Fine Point Cloud Registration with SE-Equivariant Representations
Figure 2 for Coarse-to-Fine Point Cloud Registration with SE-Equivariant Representations
Figure 3 for Coarse-to-Fine Point Cloud Registration with SE-Equivariant Representations
Figure 4 for Coarse-to-Fine Point Cloud Registration with SE-Equivariant Representations
Viaarxiv icon

CFVS: Coarse-to-Fine Visual Servoing for 6-DoF Object-Agnostic Peg-In-Hole Assembly

Add code
Sep 19, 2022
Figure 1 for CFVS: Coarse-to-Fine Visual Servoing for 6-DoF Object-Agnostic Peg-In-Hole Assembly
Figure 2 for CFVS: Coarse-to-Fine Visual Servoing for 6-DoF Object-Agnostic Peg-In-Hole Assembly
Figure 3 for CFVS: Coarse-to-Fine Visual Servoing for 6-DoF Object-Agnostic Peg-In-Hole Assembly
Figure 4 for CFVS: Coarse-to-Fine Visual Servoing for 6-DoF Object-Agnostic Peg-In-Hole Assembly
Viaarxiv icon

Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model

Add code
Sep 01, 2022
Figure 1 for Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model
Figure 2 for Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model
Figure 3 for Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model
Figure 4 for Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model
Viaarxiv icon

Vector Quantized Image-to-Image Translation

Add code
Jul 27, 2022
Figure 1 for Vector Quantized Image-to-Image Translation
Figure 2 for Vector Quantized Image-to-Image Translation
Figure 3 for Vector Quantized Image-to-Image Translation
Figure 4 for Vector Quantized Image-to-Image Translation
Viaarxiv icon

Cross-Modal 3D Shape Generation and Manipulation

Add code
Jul 24, 2022
Figure 1 for Cross-Modal 3D Shape Generation and Manipulation
Figure 2 for Cross-Modal 3D Shape Generation and Manipulation
Figure 3 for Cross-Modal 3D Shape Generation and Manipulation
Figure 4 for Cross-Modal 3D Shape Generation and Manipulation
Viaarxiv icon

Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features

Add code
Jun 02, 2022
Figure 1 for Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features
Figure 2 for Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features
Figure 3 for Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features
Figure 4 for Unveiling The Mask of Position-Information Pattern Through the Mist of Image Features
Viaarxiv icon

Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning

Add code
Mar 04, 2022
Figure 1 for Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Figure 2 for Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Figure 3 for Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Figure 4 for Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Viaarxiv icon

ADeADA: Adaptive Density-aware Active Domain Adaptation for Semantic Segmentation

Add code
Feb 15, 2022
Figure 1 for ADeADA: Adaptive Density-aware Active Domain Adaptation for Semantic Segmentation
Figure 2 for ADeADA: Adaptive Density-aware Active Domain Adaptation for Semantic Segmentation
Figure 3 for ADeADA: Adaptive Density-aware Active Domain Adaptation for Semantic Segmentation
Figure 4 for ADeADA: Adaptive Density-aware Active Domain Adaptation for Semantic Segmentation
Viaarxiv icon