Picture for Wei-Shi Zheng

Wei-Shi Zheng

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses

Add code
Aug 07, 2024
Figure 1 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 2 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 3 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 4 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Viaarxiv icon

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Add code
Jul 16, 2024
Figure 1 for PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Figure 2 for PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Figure 3 for PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Figure 4 for PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Viaarxiv icon

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

Add code
Jul 16, 2024
Figure 1 for Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Figure 2 for Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Figure 3 for Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Figure 4 for Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection
Viaarxiv icon

Human-Centric Transformer for Domain Adaptive Action Recognition

Add code
Jul 15, 2024
Viaarxiv icon

An Economic Framework for 6-DoF Grasp Detection

Add code
Jul 11, 2024
Figure 1 for An Economic Framework for 6-DoF Grasp Detection
Figure 2 for An Economic Framework for 6-DoF Grasp Detection
Figure 3 for An Economic Framework for 6-DoF Grasp Detection
Figure 4 for An Economic Framework for 6-DoF Grasp Detection
Viaarxiv icon

EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding

Add code
Jun 13, 2024
Viaarxiv icon

Grasp as You Say: Language-guided Dexterous Grasp Generation

Add code
May 29, 2024
Figure 1 for Grasp as You Say: Language-guided Dexterous Grasp Generation
Figure 2 for Grasp as You Say: Language-guided Dexterous Grasp Generation
Figure 3 for Grasp as You Say: Language-guided Dexterous Grasp Generation
Figure 4 for Grasp as You Say: Language-guided Dexterous Grasp Generation
Viaarxiv icon

Dexterous Grasp Transformer

Add code
Apr 28, 2024
Viaarxiv icon

Single-View Scene Point Cloud Human Grasp Generation

Add code
Apr 24, 2024
Viaarxiv icon

DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation

Add code
Apr 09, 2024
Figure 1 for DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
Figure 2 for DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
Figure 3 for DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
Figure 4 for DreamView: Injecting View-specific Text Guidance into Text-to-3D Generation
Viaarxiv icon