Alert button
Picture for Shaoteng Liu

Shaoteng Liu

Alert button

Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models

Add code
Bookmark button
Alert button
Mar 27, 2024
Yanwei Li, Yuechen Zhang, Chengyao Wang, Zhisheng Zhong, Yixin Chen, Ruihang Chu, Shaoteng Liu, Jiaya Jia

Viaarxiv icon

RL-GPT: Integrating Reinforcement Learning and Code-as-policy

Add code
Bookmark button
Alert button
Feb 29, 2024
Shaoteng Liu, Haoqi Yuan, Minda Hu, Yanwei Li, Yukang Chen, Shu Liu, Zongqing Lu, Jiaya Jia

Viaarxiv icon

Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Add code
Bookmark button
Alert button
Oct 19, 2023
Xuan Ju, Ailing Zeng, Yuxuan Bian, Shaoteng Liu, Qiang Xu

Figure 1 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 2 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 3 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 4 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Viaarxiv icon

Self-supervised Learning by View Synthesis

Add code
Bookmark button
Alert button
Apr 22, 2023
Shaoteng Liu, Xiangyu Zhang, Tao Hu, Jiaya Jia

Figure 1 for Self-supervised Learning by View Synthesis
Figure 2 for Self-supervised Learning by View Synthesis
Figure 3 for Self-supervised Learning by View Synthesis
Figure 4 for Self-supervised Learning by View Synthesis
Viaarxiv icon

Video-P2P: Video Editing with Cross-attention Control

Add code
Bookmark button
Alert button
Mar 08, 2023
Shaoteng Liu, Yuechen Zhang, Wenbo Li, Zhe Lin, Jiaya Jia

Figure 1 for Video-P2P: Video Editing with Cross-attention Control
Figure 2 for Video-P2P: Video Editing with Cross-attention Control
Figure 3 for Video-P2P: Video Editing with Cross-attention Control
Figure 4 for Video-P2P: Video Editing with Cross-attention Control
Viaarxiv icon

Generative Model Watermarking Based on Human Visual System

Add code
Bookmark button
Alert button
Sep 30, 2022
Li Zhang, Yong Liu, Shaoteng Liu, Tianshu Yang, Yexin Wang, Xinpeng Zhang, Hanzhou Wu

Figure 1 for Generative Model Watermarking Based on Human Visual System
Figure 2 for Generative Model Watermarking Based on Human Visual System
Figure 3 for Generative Model Watermarking Based on Human Visual System
Figure 4 for Generative Model Watermarking Based on Human Visual System
Viaarxiv icon

On-target Adaptation

Add code
Bookmark button
Alert button
Sep 02, 2021
Dequan Wang, Shaoteng Liu, Sayna Ebrahimi, Evan Shelhamer, Trevor Darrell

Figure 1 for On-target Adaptation
Figure 2 for On-target Adaptation
Figure 3 for On-target Adaptation
Figure 4 for On-target Adaptation
Viaarxiv icon

Multi-modal Cooking Workflow Construction for Food Recipes

Add code
Bookmark button
Alert button
Aug 20, 2020
Liangming Pan, Jingjing Chen, Jianlong Wu, Shaoteng Liu, Chong-Wah Ngo, Min-Yen Kan, Yu-Gang Jiang, Tat-Seng Chua

Figure 1 for Multi-modal Cooking Workflow Construction for Food Recipes
Figure 2 for Multi-modal Cooking Workflow Construction for Food Recipes
Figure 3 for Multi-modal Cooking Workflow Construction for Food Recipes
Figure 4 for Multi-modal Cooking Workflow Construction for Food Recipes
Viaarxiv icon

GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy

Add code
Bookmark button
Alert button
Jul 21, 2020
Shaoteng Liu, Lijun Gong, Kai Ma, Yefeng Zheng

Figure 1 for GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy
Figure 2 for GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy
Figure 3 for GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy
Figure 4 for GREEN: a Graph REsidual rE-ranking Network for Grading Diabetic Retinopathy
Viaarxiv icon