Alert button
Picture for Zixu Zhao

Zixu Zhao

Alert button

PeFoMed: Parameter Efficient Fine-tuning on Multimodal Large Language Models for Medical Visual Question Answering

Add code
Bookmark button
Alert button
Jan 05, 2024
Jinlong He, Pengfei Li, Gang Liu, Zixu Zhao, Shenjun Zhong

Viaarxiv icon

Unsupervised Open-Vocabulary Object Localization in Videos

Add code
Bookmark button
Alert button
Sep 18, 2023
Ke Fan, Zechen Bai, Tianjun Xiao, Dominik Zietlow, Max Horn, Zixu Zhao, Carl-Johann Simon-Gabriel, Mike Zheng Shou, Francesco Locatello, Bernt Schiele, Thomas Brox, Zheng Zhang, Yanwei Fu, Tong He

Figure 1 for Unsupervised Open-Vocabulary Object Localization in Videos
Figure 2 for Unsupervised Open-Vocabulary Object Localization in Videos
Figure 3 for Unsupervised Open-Vocabulary Object Localization in Videos
Figure 4 for Unsupervised Open-Vocabulary Object Localization in Videos
Viaarxiv icon

Object-Centric Multiple Object Tracking

Add code
Bookmark button
Alert button
Sep 05, 2023
Zixu Zhao, Jiaze Wang, Max Horn, Yizhuo Ding, Tong He, Zechen Bai, Dominik Zietlow, Carl-Johann Simon-Gabriel, Bing Shuai, Zhuowen Tu, Thomas Brox, Bernt Schiele, Yanwei Fu, Francesco Locatello, Zheng Zhang, Tianjun Xiao

Figure 1 for Object-Centric Multiple Object Tracking
Figure 2 for Object-Centric Multiple Object Tracking
Figure 3 for Object-Centric Multiple Object Tracking
Figure 4 for Object-Centric Multiple Object Tracking
Viaarxiv icon

Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering

Add code
Bookmark button
Alert button
Jul 11, 2023
Pengfei Li, Gang Liu, Jinlong He, Zixu Zhao, Shenjun Zhong

Figure 1 for Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering
Figure 2 for Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering
Figure 3 for Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering
Figure 4 for Masked Vision and Language Pre-training with Unimodal and Multimodal Contrastive Losses for Medical Visual Question Answering
Viaarxiv icon

PointPatchMix: Point Cloud Mixing with Patch Scoring

Add code
Bookmark button
Alert button
Mar 12, 2023
Yi Wang, Jiaze Wang, Jinpeng Li, Zixu Zhao, Guangyong Chen, Anfeng Liu, Pheng-Ann Heng

Figure 1 for PointPatchMix: Point Cloud Mixing with Patch Scoring
Figure 2 for PointPatchMix: Point Cloud Mixing with Patch Scoring
Figure 3 for PointPatchMix: Point Cloud Mixing with Patch Scoring
Figure 4 for PointPatchMix: Point Cloud Mixing with Patch Scoring
Viaarxiv icon

Pseudo-label Guided Cross-video Pixel Contrast for Robotic Surgical Scene Segmentation with Limited Annotations

Add code
Bookmark button
Alert button
Jul 20, 2022
Yang Yu, Zixu Zhao, Yueming Jin, Guangyong Chen, Qi Dou, Pheng-Ann Heng

Figure 1 for Pseudo-label Guided Cross-video Pixel Contrast for Robotic Surgical Scene Segmentation with Limited Annotations
Figure 2 for Pseudo-label Guided Cross-video Pixel Contrast for Robotic Surgical Scene Segmentation with Limited Annotations
Figure 3 for Pseudo-label Guided Cross-video Pixel Contrast for Robotic Surgical Scene Segmentation with Limited Annotations
Figure 4 for Pseudo-label Guided Cross-video Pixel Contrast for Robotic Surgical Scene Segmentation with Limited Annotations
Viaarxiv icon

Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation

Add code
Bookmark button
Alert button
Mar 29, 2022
Yueming Jin, Yang Yu, Cheng Chen, Zixu Zhao, Pheng-Ann Heng, Danail Stoyanov

Figure 1 for Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation
Figure 2 for Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation
Figure 3 for Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation
Figure 4 for Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation
Viaarxiv icon

TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery

Add code
Bookmark button
Alert button
Feb 17, 2022
Zixu Zhao, Yueming Jin, Pheng-Ann Heng

Figure 1 for TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery
Figure 2 for TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery
Figure 3 for TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery
Figure 4 for TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery
Viaarxiv icon

Modelling Neighbor Relation in Joint Space-Time Graph for Video Correspondence Learning

Add code
Bookmark button
Alert button
Sep 28, 2021
Zixu Zhao, Yueming Jin, Pheng-Ann Heng

Figure 1 for Modelling Neighbor Relation in Joint Space-Time Graph for Video Correspondence Learning
Figure 2 for Modelling Neighbor Relation in Joint Space-Time Graph for Video Correspondence Learning
Figure 3 for Modelling Neighbor Relation in Joint Space-Time Graph for Video Correspondence Learning
Figure 4 for Modelling Neighbor Relation in Joint Space-Time Graph for Video Correspondence Learning
Viaarxiv icon