Picture for Wei-Shi Zheng

Wei-Shi Zheng

Real-to-Sim Grasp: Rethinking the Gap between Simulation and Real World in Grasp Detection

Add code
Oct 09, 2024
Viaarxiv icon

Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization

Add code
Aug 25, 2024
Viaarxiv icon

ParGo: Bridging Vision-Language with Partial and Global Views

Add code
Aug 23, 2024
Viaarxiv icon

PixelFade: Privacy-preserving Person Re-identification with Noise-guided Progressive Replacement

Add code
Aug 10, 2024
Viaarxiv icon

Loc4Plan: Locating Before Planning for Outdoor Vision and Language Navigation

Add code
Aug 09, 2024
Viaarxiv icon

SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses

Add code
Aug 07, 2024
Figure 1 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 2 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 3 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Figure 4 for SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
Viaarxiv icon

Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection

Add code
Jul 16, 2024
Viaarxiv icon

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Add code
Jul 16, 2024
Viaarxiv icon

Human-Centric Transformer for Domain Adaptive Action Recognition

Add code
Jul 15, 2024
Viaarxiv icon

An Economic Framework for 6-DoF Grasp Detection

Add code
Jul 11, 2024
Viaarxiv icon