Alert button
Picture for Linli Xu

Linli Xu

Alert button

HRVDA: High-Resolution Visual Document Assistant

Add code
Bookmark button
Alert button
Apr 10, 2024
Chaohu Liu, Kun Yin, Haoyu Cao, Xinghua Jiang, Xin Li, Yinsong Liu, Deqiang Jiang, Xing Sun, Linli Xu

Viaarxiv icon

Communication-Efficient Distributed Learning with Local Immediate Error Compensation

Add code
Bookmark button
Alert button
Feb 19, 2024
Yifei Cheng, Li Shen, Linli Xu, Xun Qian, Shiwei Wu, Yiming Zhou, Tie Zhang, Dacheng Tao, Enhong Chen

Viaarxiv icon

Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks

Add code
Bookmark button
Alert button
Jan 18, 2024
Yichao Du, Zhirui Zhang, Linan Yue, Xu Huang, Yuqing Zhang, Tong Xu, Linli Xu, Enhong Chen

Viaarxiv icon

DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation

Add code
Bookmark button
Alert button
Oct 26, 2023
Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Zhongyi Ye, Linli Xu

Viaarxiv icon

Multi-Grained Multimodal Interaction Network for Entity Linking

Add code
Bookmark button
Alert button
Jul 19, 2023
Pengfei Luo, Tong Xu, Shiwei Wu, Chen Zhu, Linli Xu, Enhong Chen

Figure 1 for Multi-Grained Multimodal Interaction Network for Entity Linking
Figure 2 for Multi-Grained Multimodal Interaction Network for Entity Linking
Figure 3 for Multi-Grained Multimodal Interaction Network for Entity Linking
Figure 4 for Multi-Grained Multimodal Interaction Network for Entity Linking
Viaarxiv icon

End-to-End Word-Level Pronunciation Assessment with MASK Pre-training

Add code
Bookmark button
Alert button
Jun 05, 2023
Yukang Liang, Kaitao Song, Shaoguang Mao, Huiqiang Jiang, Luna Qiu, Yuqing Yang, Dongsheng Li, Linli Xu, Lili Qiu

Figure 1 for End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
Figure 2 for End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
Figure 3 for End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
Figure 4 for End-to-End Word-Level Pronunciation Assessment with MASK Pre-training
Viaarxiv icon

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA

Add code
Bookmark button
Alert button
Apr 04, 2023
Yongxin Zhu, Zhen Liu, Yukang Liang, Xin Li, Hao Liu, Changcun Bao, Linli Xu

Figure 1 for Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Figure 2 for Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Figure 3 for Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Figure 4 for Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Viaarxiv icon

Difformer: Empowering Diffusion Model on Embedding Space for Text Generation

Add code
Bookmark button
Alert button
Dec 19, 2022
Zhujin Gao, Junliang Guo, Xu Tan, Yongxin Zhu, Fang Zhang, Jiang Bian, Linli Xu

Figure 1 for Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Figure 2 for Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Figure 3 for Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Figure 4 for Difformer: Empowering Diffusion Model on Embedding Space for Text Generation
Viaarxiv icon

Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation

Add code
Bookmark button
Alert button
Sep 05, 2022
Peining Zhang, Junliang Guo, Linli Xu, Mu You, Junming Yin

Figure 1 for Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
Figure 2 for Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
Figure 3 for Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
Figure 4 for Bridging Music and Text with Crowdsourced Music Comments: A Sequence-to-Sequence Framework for Thematic Music Comments Generation
Viaarxiv icon

Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation

Add code
Bookmark button
Alert button
May 22, 2022
Jiquan Li, Junliang Guo, Yongxin Zhu, Xin Sheng, Deqiang Jiang, Bo Ren, Linli Xu

Figure 1 for Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation
Figure 2 for Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation
Figure 3 for Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation
Figure 4 for Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation
Viaarxiv icon