Alert button
Picture for Xiang Bai

Xiang Bai

Alert button

Bridging the Gap Between End-to-End and Two-Step Text Spotting

Add code
Bookmark button
Alert button
Apr 06, 2024
Mingxin Huang, Hongliang Li, Yuliang Liu, Xiang Bai, Lianwen Jin

Viaarxiv icon

SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

Add code
Bookmark button
Alert button
Apr 04, 2024
Zijie Wu, Chaohui Yu, Yanqin Jiang, Chenjie Cao, Fan Wang, Xiang Bai

Viaarxiv icon

OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition

Add code
Bookmark button
Alert button
Mar 28, 2024
Jianqiang Wan, Sibo Song, Wenwen Yu, Yuliang Liu, Wenqing Cheng, Fei Huang, Xiang Bai, Cong Yao, Zhibo Yang

Viaarxiv icon

PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model

Add code
Bookmark button
Alert button
Mar 21, 2024
Zheng Zhang, Yeyao Ma, Enming Zhang, Xiang Bai

Figure 1 for PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Figure 2 for PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Figure 3 for PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Figure 4 for PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model
Viaarxiv icon

TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document

Add code
Bookmark button
Alert button
Mar 15, 2024
Yuliang Liu, Biao Yang, Qiang Liu, Zhang Li, Zhiyin Ma, Shuo Zhang, Xiang Bai

Figure 1 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 2 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 3 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Figure 4 for TextMonkey: An OCR-Free Large Multimodal Model for Understanding Document
Viaarxiv icon

Anomaly Detection by Adapting a pre-trained Vision Language Model

Add code
Bookmark button
Alert button
Mar 14, 2024
Yuxuan Cai, Xinwei He, Dingkang Liang, Ao Tong, Xiang Bai

Figure 1 for Anomaly Detection by Adapting a pre-trained Vision Language Model
Figure 2 for Anomaly Detection by Adapting a pre-trained Vision Language Model
Figure 3 for Anomaly Detection by Adapting a pre-trained Vision Language Model
Figure 4 for Anomaly Detection by Adapting a pre-trained Vision Language Model
Viaarxiv icon

Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis

Add code
Bookmark button
Alert button
Mar 03, 2024
Xin Zhou, Dingkang Liang, Wei Xu, Xingkui Zhu, Yihan Xu, Zhikang Zou, Xiang Bai

Figure 1 for Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
Figure 2 for Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
Figure 3 for Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
Figure 4 for Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis
Viaarxiv icon

Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition

Add code
Bookmark button
Alert button
Feb 24, 2024
Mingkun Yang, Biao Yang, Minghui Liao, Yingying Zhu, Xiang Bai

Viaarxiv icon

Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition

Add code
Bookmark button
Alert button
Feb 21, 2024
Mingkun Yang, Biao Yang, Minghui Liao, Yingying Zhu, Xiang Bai

Viaarxiv icon