Alert button
Picture for Hang Xu

Hang Xu

Alert button

DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection

Add code
Bookmark button
Alert button
Apr 14, 2024
Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu

Viaarxiv icon

Self-Adaptive Reality-Guided Diffusion for Artifact-Free Super-Resolution

Add code
Bookmark button
Alert button
Mar 25, 2024
Qingping Zheng, Ling Zheng, Yuanfan Guo, Ying Li, Songcen Xu, Jiankang Deng, Hang Xu

Viaarxiv icon

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Add code
Bookmark button
Alert button
Mar 22, 2024
Yunhao Gou, Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James T. Kwok, Yu Zhang

Figure 1 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 2 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 3 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 4 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Viaarxiv icon

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Add code
Bookmark button
Alert button
Mar 18, 2024
Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu

Figure 1 for LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Figure 2 for LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Figure 3 for LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Figure 4 for LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
Viaarxiv icon

OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation

Add code
Bookmark button
Alert button
Mar 18, 2024
Haochen Jiang, Yueming Xu, Yihan Zeng, Hang Xu, Wei Zhang, Jianfeng Feng, Li Zhang

Figure 1 for OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation
Figure 2 for OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation
Figure 3 for OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation
Figure 4 for OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation
Viaarxiv icon

NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

Add code
Bookmark button
Alert button
Mar 12, 2024
Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang

Figure 1 for NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Figure 2 for NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Figure 3 for NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Figure 4 for NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
Viaarxiv icon

From Summary to Action: Enhancing Large Language Models for Complex Tasks with Open World APIs

Add code
Bookmark button
Alert button
Feb 28, 2024
Yulong Liu, Yunlong Yuan, Chunwei Wang, Jianhua Han, Yongqiang Ma, Li Zhang, Nanning Zheng, Hang Xu

Viaarxiv icon

Optimal Parallelization Strategies for Active Flow Control in Deep Reinforcement Learning-Based Computational Fluid Dynamics

Add code
Bookmark button
Alert button
Feb 18, 2024
Wang Jia, Hang Xu

Viaarxiv icon

GS-CLIP: Gaussian Splatting for Contrastive Language-Image-3D Pretraining from Real-World Data

Add code
Bookmark button
Alert button
Feb 13, 2024
Haoyuan Li, Yanpeng Zhou, Yihan Zeng, Hang Xu, Xiaodan Liang

Viaarxiv icon