Picture for Hai Zhao

Hai Zhao

Department of Computer Science and Engineering, Shanghai Jiao Tong University, Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai Jiao Tong University

Enhancing Visually-Rich Document Understanding via Layout Structure Modeling

Add code
Aug 15, 2023
Viaarxiv icon

Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers

Add code
Jul 02, 2023
Figure 1 for Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers
Figure 2 for Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers
Figure 3 for Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers
Figure 4 for Bidirectional Looking with A Novel Double Exponential Moving Average to Adaptive and Non-adaptive Momentum Optimizers
Viaarxiv icon

BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer

Add code
Jul 01, 2023
Figure 1 for BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer
Figure 2 for BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer
Figure 3 for BatGPT: A Bidirectional Autoregessive Talker from Generative Pre-trained Transformer
Viaarxiv icon

Modeling Hierarchical Reasoning Chains by Linking Discourse Units and Key Phrases for Reading Comprehension

Add code
Jun 21, 2023
Figure 1 for Modeling Hierarchical Reasoning Chains by Linking Discourse Units and Key Phrases for Reading Comprehension
Figure 2 for Modeling Hierarchical Reasoning Chains by Linking Discourse Units and Key Phrases for Reading Comprehension
Figure 3 for Modeling Hierarchical Reasoning Chains by Linking Discourse Units and Key Phrases for Reading Comprehension
Figure 4 for Modeling Hierarchical Reasoning Chains by Linking Discourse Units and Key Phrases for Reading Comprehension
Viaarxiv icon

FSUIE: A Novel Fuzzy Span Mechanism for Universal Information Extraction

Add code
Jun 19, 2023
Viaarxiv icon

CMMLU: Measuring massive multitask language understanding in Chinese

Add code
Jun 15, 2023
Figure 1 for CMMLU: Measuring massive multitask language understanding in Chinese
Figure 2 for CMMLU: Measuring massive multitask language understanding in Chinese
Figure 3 for CMMLU: Measuring massive multitask language understanding in Chinese
Figure 4 for CMMLU: Measuring massive multitask language understanding in Chinese
Viaarxiv icon

Rethinking Masked Language Modeling for Chinese Spelling Correction

Add code
May 28, 2023
Viaarxiv icon

Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models

Add code
May 26, 2023
Figure 1 for Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models
Figure 2 for Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models
Figure 3 for Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models
Figure 4 for Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Large Language Models
Viaarxiv icon

RefGPT: Reference -> Truthful & Customized Dialogues Generation by GPTs and for GPTs

Add code
May 25, 2023
Viaarxiv icon

Pre-training Multi-party Dialogue Models with Latent Discourse Inference

Add code
May 24, 2023
Viaarxiv icon