Alert button
Picture for Xinghua Jiang

Xinghua Jiang

Alert button

HRVDA: High-Resolution Visual Document Assistant

Add code
Bookmark button
Alert button
Apr 10, 2024
Chaohu Liu, Kun Yin, Haoyu Cao, Xinghua Jiang, Xin Li, Yinsong Liu, Deqiang Jiang, Xing Sun, Linli Xu

Viaarxiv icon

Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models

Add code
Bookmark button
Alert button
Feb 29, 2024
Xin Li, Yunfei Wu, Xinghua Jiang, Zhihao Guo, Mingming Gong, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun

Viaarxiv icon

AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes

Add code
Bookmark button
Alert button
Aug 25, 2023
Zhaohui Li, Haitao Wang, Xinghua Jiang

Figure 1 for AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Figure 2 for AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Figure 3 for AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Figure 4 for AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes
Viaarxiv icon

OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification

Add code
Bookmark button
Alert button
Jul 04, 2022
Ye Liu, Lingfeng Qiao, Di Yin, Zhuoxuan Jiang, Xinghua Jiang, Deqiang Jiang, Bo Ren

Figure 1 for OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Figure 2 for OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Figure 3 for OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Figure 4 for OS-MSL: One Stage Multimodal Sequential Link Framework for Scene Segmentation and Classification
Viaarxiv icon

The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training

Add code
Bookmark button
Alert button
Apr 18, 2022
Hao Liu, Xinghua Jiang, Xin Li, Antai Guo, Deqiang Jiang, Bo Ren

Figure 1 for The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Figure 2 for The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Figure 3 for The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Figure 4 for The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Viaarxiv icon

NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition

Add code
Bookmark button
Alert button
Nov 25, 2021
Hao Liu, Xinghua Jiang, Xin Li, Zhimin Bao, Deqiang Jiang, Bo Ren

Figure 1 for NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition
Figure 2 for NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition
Figure 3 for NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition
Figure 4 for NomMer: Nominate Synergistic Context in Vision Transformer for Visual Recognition
Viaarxiv icon