Picture for Zhedong Zheng

Zhedong Zheng

Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene

Add code
Jul 11, 2024
Viaarxiv icon

From Data Deluge to Data Curation: A Filtering-WoRA Paradigm for Efficient Text-based Person Search

Add code
Apr 16, 2024
Viaarxiv icon

Instilling Multi-round Thinking to Text-guided Image Generation

Add code
Jan 16, 2024
Viaarxiv icon

Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching

Add code
Nov 21, 2023
Viaarxiv icon

Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation

Add code
Nov 21, 2023
Figure 1 for Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation
Figure 2 for Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation
Figure 3 for Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation
Figure 4 for Transferring to Real-World Layouts: A Depth-aware Framework for Scene Adaptation
Viaarxiv icon

Progressive Text-to-3D Generation for Automatic 3D Prototyping

Add code
Sep 26, 2023
Figure 1 for Progressive Text-to-3D Generation for Automatic 3D Prototyping
Figure 2 for Progressive Text-to-3D Generation for Automatic 3D Prototyping
Figure 3 for Progressive Text-to-3D Generation for Automatic 3D Prototyping
Figure 4 for Progressive Text-to-3D Generation for Automatic 3D Prototyping
Viaarxiv icon

Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark

Add code
Jun 06, 2023
Figure 1 for Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Figure 2 for Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Figure 3 for Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Figure 4 for Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark
Viaarxiv icon

Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval

Add code
Jun 03, 2023
Figure 1 for Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval
Figure 2 for Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval
Figure 3 for Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval
Figure 4 for Relieving Triplet Ambiguity: Consensus Network for Language-Guided Image Retrieval
Viaarxiv icon

Actively Discovering New Slots for Task-oriented Conversation

Add code
May 06, 2023
Figure 1 for Actively Discovering New Slots for Task-oriented Conversation
Figure 2 for Actively Discovering New Slots for Task-oriented Conversation
Figure 3 for Actively Discovering New Slots for Task-oriented Conversation
Figure 4 for Actively Discovering New Slots for Task-oriented Conversation
Viaarxiv icon

Learnable Pillar-based Re-ranking for Image-Text Retrieval

Add code
Apr 25, 2023
Figure 1 for Learnable Pillar-based Re-ranking for Image-Text Retrieval
Figure 2 for Learnable Pillar-based Re-ranking for Image-Text Retrieval
Figure 3 for Learnable Pillar-based Re-ranking for Image-Text Retrieval
Figure 4 for Learnable Pillar-based Re-ranking for Image-Text Retrieval
Viaarxiv icon