Alert button
Picture for Hao Sun

Hao Sun

Alert button

Towards Verifiable Text Generation with Evolving Memory and Self-Reflection

Add code
Bookmark button
Alert button
Dec 14, 2023
Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin

Figure 1 for Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
Figure 2 for Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
Figure 3 for Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
Figure 4 for Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
Viaarxiv icon

Unveiling the Implicit Toxicity in Large Language Models

Add code
Bookmark button
Alert button
Nov 29, 2023
Jiaxin Wen, Pei Ke, Hao Sun, Zhexin Zhang, Chengfei Li, Jinfeng Bai, Minlie Huang

Viaarxiv icon

Robust Domain Misinformation Detection via Multi-modal Feature Alignment

Add code
Bookmark button
Alert button
Nov 24, 2023
Hui Liu, Wenya Wang, Hao Sun, Anderson Rocha, Haoliang Li

Viaarxiv icon

When is Off-Policy Evaluation Useful? A Data-Centric Perspective

Add code
Bookmark button
Alert button
Nov 23, 2023
Hao Sun, Alex J. Chan, Nabeel Seedat, Alihan Hüyük, Mihaela van der Schaar

Figure 1 for When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Figure 2 for When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Figure 3 for When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Figure 4 for When is Off-Policy Evaluation Useful? A Data-Centric Perspective
Viaarxiv icon

AI-accelerated Discovery of Altermagnetic Materials

Add code
Bookmark button
Alert button
Nov 13, 2023
Ze-Feng Gao, Shuai Qu, Bocheng Zeng, Yang Liu, Ji-Rong Wen, Hao Sun, Peng-Jie Guo, Zhong-Yi Lu

Figure 1 for AI-accelerated Discovery of Altermagnetic Materials
Figure 2 for AI-accelerated Discovery of Altermagnetic Materials
Figure 3 for AI-accelerated Discovery of Altermagnetic Materials
Figure 4 for AI-accelerated Discovery of Altermagnetic Materials
Viaarxiv icon

Character-level Chinese Backpack Language Models

Add code
Bookmark button
Alert button
Oct 19, 2023
Hao Sun, John Hewitt

Figure 1 for Character-level Chinese Backpack Language Models
Figure 2 for Character-level Chinese Backpack Language Models
Figure 3 for Character-level Chinese Backpack Language Models
Figure 4 for Character-level Chinese Backpack Language Models
Viaarxiv icon

Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples

Add code
Bookmark button
Alert button
Oct 11, 2023
Hao Sun, Alihan Hüyük, Daniel Jarrett, Mihaela van der Schaar

Figure 1 for Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Figure 2 for Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Figure 3 for Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Figure 4 for Accountability in Offline Reinforcement Learning: Explaining Decisions with a Corpus of Examples
Viaarxiv icon

Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond

Add code
Bookmark button
Alert button
Oct 09, 2023
Hao Sun

Figure 1 for Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Figure 2 for Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Figure 3 for Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Figure 4 for Reinforcement Learning in the Era of LLMs: What is Essential? What is needed? An RL Perspective on RLHF, Prompting, and Beyond
Viaarxiv icon

Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL

Add code
Bookmark button
Alert button
Sep 29, 2023
Hao Sun, Alihan Hüyük, Mihaela van der Schaar

Figure 1 for Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
Figure 2 for Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
Figure 3 for Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
Figure 4 for Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
Viaarxiv icon