Picture for Wei Zhang

Wei Zhang

Alibaba Group

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Add code
Mar 18, 2024
Viaarxiv icon

OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation

Add code
Mar 18, 2024
Viaarxiv icon

TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models

Add code
Mar 17, 2024
Figure 1 for TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models
Figure 2 for TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models
Figure 3 for TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models
Figure 4 for TRELM: Towards Robust and Efficient Pre-training for Knowledge-Enhanced Language Models
Viaarxiv icon

Affective Behaviour Analysis via Integrating Multi-Modal Knowledge

Add code
Mar 16, 2024
Figure 1 for Affective Behaviour Analysis via Integrating Multi-Modal Knowledge
Figure 2 for Affective Behaviour Analysis via Integrating Multi-Modal Knowledge
Figure 3 for Affective Behaviour Analysis via Integrating Multi-Modal Knowledge
Figure 4 for Affective Behaviour Analysis via Integrating Multi-Modal Knowledge
Viaarxiv icon

Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors

Add code
Mar 14, 2024
Figure 1 for Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
Figure 2 for Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
Figure 3 for Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
Figure 4 for Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
Viaarxiv icon

OneVOS: Unifying Video Object Segmentation with All-in-One Transformer Framework

Add code
Mar 13, 2024
Viaarxiv icon

Query-guided Prototype Evolution Network for Few-Shot Segmentation

Add code
Mar 11, 2024
Viaarxiv icon

ClickVOS: Click Video Object Segmentation

Add code
Mar 10, 2024
Figure 1 for ClickVOS: Click Video Object Segmentation
Figure 2 for ClickVOS: Click Video Object Segmentation
Figure 3 for ClickVOS: Click Video Object Segmentation
Figure 4 for ClickVOS: Click Video Object Segmentation
Viaarxiv icon

Aligning Large Language Models for Controllable Recommendations

Add code
Mar 08, 2024
Figure 1 for Aligning Large Language Models for Controllable Recommendations
Figure 2 for Aligning Large Language Models for Controllable Recommendations
Figure 3 for Aligning Large Language Models for Controllable Recommendations
Figure 4 for Aligning Large Language Models for Controllable Recommendations
Viaarxiv icon

Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery

Add code
Mar 06, 2024
Figure 1 for Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Figure 2 for Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Figure 3 for Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Figure 4 for Popeye: A Unified Visual-Language Model for Multi-Source Ship Detection from Remote Sensing Imagery
Viaarxiv icon