Alert button
Picture for Hongshen Xu

Hongshen Xu

Alert button

Rejection Improves Reliability: Training LLMs to Refuse Unknown Questions Using RL from Knowledge Feedback

Add code
Bookmark button
Alert button
Apr 07, 2024
Hongshen Xu, Zichen Zhu, Situo Zhang, Da Ma, Shuai Fan, Lu Chen, Kai Yu

Viaarxiv icon

Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind

Add code
Bookmark button
Alert button
Apr 06, 2024
Hongchuan Zeng, Hongshen Xu, Lu Chen, Kai Yu

Viaarxiv icon

Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding

Add code
Bookmark button
Alert button
Feb 28, 2024
Hongshen Xu, Lu Chen, Zihan Zhao, Da Ma, Ruisheng Cao, Zichen Zhu, Kai Yu

Viaarxiv icon

A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames

Add code
Bookmark button
Alert button
Feb 28, 2024
Hongshen Xu, Ruisheng Cao, Su Zhu, Sheng Jiang, Hanchong Zhang, Lu Chen, Kai Yu

Viaarxiv icon

ChemDFM: Dialogue Foundation Model for Chemistry

Add code
Bookmark button
Alert button
Jan 26, 2024
Zihan Zhao, Da Ma, Lu Chen, Liangtai Sun, Zihao Li, Hongshen Xu, Zichen Zhu, Su Zhu, Shuai Fan, Guodong Shen, Xin Chen, Kai Yu

Viaarxiv icon

ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL

Add code
Bookmark button
Alert button
Oct 28, 2023
Ruisheng Cao, Hanchong Zhang, Hongshen Xu, Jieyu Li, Da Ma, Lu Chen, Kai Yu

Figure 1 for ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
Figure 2 for ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
Figure 3 for ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
Figure 4 for ASTormer: An AST Structure-aware Transformer Decoder for Text-to-SQL
Viaarxiv icon

ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought

Add code
Bookmark button
Alert button
Oct 26, 2023
Hanchong Zhang, Ruisheng Cao, Lu Chen, Hongshen Xu, Kai Yu

Viaarxiv icon

Large Language Model Is Semi-Parametric Reinforcement Learning Agent

Add code
Bookmark button
Alert button
Jun 09, 2023
Danyang Zhang, Lu Chen, Situo Zhang, Hongshen Xu, Zihan Zhao, Kai Yu

Figure 1 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 2 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 3 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Figure 4 for Large Language Model Is Semi-Parametric Reinforcement Learning Agent
Viaarxiv icon

On the Structural Generalization in Text-to-SQL

Add code
Bookmark button
Alert button
Jan 21, 2023
Jieyu Li, Lu Chen, Ruisheng Cao, Su Zhu, Hongshen Xu, Zhi Chen, Hanchong Zhang, Kai Yu

Figure 1 for On the Structural Generalization in Text-to-SQL
Figure 2 for On the Structural Generalization in Text-to-SQL
Figure 3 for On the Structural Generalization in Text-to-SQL
Figure 4 for On the Structural Generalization in Text-to-SQL
Viaarxiv icon