Alert button
Picture for Can Huang

Can Huang

Alert button

Metasql: A Generate-then-Rank Framework for Natural Language to SQL Translation

Feb 27, 2024
Yuankai Fan, Zhenying He, Tonghui Ren, Can Huang, Yinan Jing, Kai Zhang, X. Sean Wang

Viaarxiv icon

PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition

Feb 15, 2024
Jinghui Lu, Ziwei Yang, Yanjie Wang, Xuejing Liu, Brian Mac Namee, Can Huang

Viaarxiv icon

GloTSFormer: Global Video Text Spotting Transformer

Jan 08, 2024
Han Wang, Yanjie Wang, Yang Li, Can Huang

Viaarxiv icon

DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding

Nov 30, 2023
Hao Feng, Qi Liu, Hao Liu, Wengang Zhou, Houqiang Li, Can Huang

Figure 1 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 2 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 3 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Figure 4 for DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding
Viaarxiv icon

Multi-modal In-Context Learning Makes an Ego-evolving Scene Text Recognizer

Nov 23, 2023
Zhen Zhao, Jingqun Tang, Chunhui Lin, Binghong Wu, Hao Liu, Zhizhong Zhang, Xin Tan, Can Huang, Yuan Xie

Viaarxiv icon

UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding

Sep 02, 2023
Hao Feng, Zijian Wang, Jingqun Tang, Jinghui Lu, Wengang Zhou, Houqiang Li, Can Huang

Figure 1 for UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Figure 2 for UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Figure 3 for UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Figure 4 for UniDoc: A Universal Large Multimodal Model for Simultaneous Text Detection, Recognition, Spotting and Understanding
Viaarxiv icon

ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer

Aug 20, 2023
Mingxin Huang, Jiaxin Zhang, Dezhi Peng, Hao Lu, Can Huang, Yuliang Liu, Xiang Bai, Lianwen Jin

Figure 1 for ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Figure 2 for ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Figure 3 for ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Figure 4 for ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
Viaarxiv icon