Alert button
Picture for Lei Hou

Lei Hou

Alert button

Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions

Add code
Bookmark button
Alert button
Nov 23, 2023
Shulin Cao, Jiajie Zhang, Jiaxin Shi, Xin Lv, Zijun Yao, Qi Tian, Juanzi Li, Lei Hou

Figure 1 for Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions
Figure 2 for Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions
Figure 3 for Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions
Figure 4 for Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions
Viaarxiv icon

MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation

Add code
Bookmark button
Alert button
Nov 15, 2023
Xiaozhi Wang, Hao Peng, Yong Guan, Kaisheng Zeng, Jianhui Chen, Lei Hou, Xu Han, Yankai Lin, Zhiyuan Liu, Ruobing Xie, Jie Zhou, Juanzi Li

Figure 1 for MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation
Figure 2 for MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation
Figure 3 for MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation
Figure 4 for MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation
Viaarxiv icon

When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks

Add code
Bookmark button
Alert button
Nov 15, 2023
Hao Peng, Xiaozhi Wang, Jianhui Chen, Weikai Li, Yunjia Qi, Zimu Wang, Zhili Wu, Kaisheng Zeng, Bin Xu, Lei Hou, Juanzi Li

Figure 1 for When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks
Figure 2 for When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks
Figure 3 for When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks
Figure 4 for When does In-context Learning Fall Short and Why? A Study on Specification-Heavy Tasks
Viaarxiv icon

WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models

Add code
Bookmark button
Alert button
Nov 13, 2023
Shangqing Tu, Yuliang Sun, Yushi Bai, Jifan Yu, Lei Hou, Juanzi Li

Figure 1 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 2 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 3 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Figure 4 for WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
Viaarxiv icon

Exploring the Cognitive Knowledge Structure of Large Language Models: An Educational Diagnostic Assessment Approach

Add code
Bookmark button
Alert button
Oct 18, 2023
Zheyuan Zhang, Jifan Yu, Juanzi Li, Lei Hou

Figure 1 for Exploring the Cognitive Knowledge Structure of Large Language Models: An Educational Diagnostic Assessment Approach
Figure 2 for Exploring the Cognitive Knowledge Structure of Large Language Models: An Educational Diagnostic Assessment Approach
Figure 3 for Exploring the Cognitive Knowledge Structure of Large Language Models: An Educational Diagnostic Assessment Approach
Figure 4 for Exploring the Cognitive Knowledge Structure of Large Language Models: An Educational Diagnostic Assessment Approach
Viaarxiv icon

Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment

Add code
Bookmark button
Alert button
Oct 16, 2023
Ji Qi, Kaixuan Ji, Xiaozhi Wang, Jifan Yu, Kaisheng Zeng, Lei Hou, Juanzi Li, Bin Xu

Figure 1 for Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment
Figure 2 for Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment
Figure 3 for Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment
Figure 4 for Mastering the Task of Open Information Extraction with Large Language Models and Consistent Reasoning Environment
Viaarxiv icon

BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation

Add code
Bookmark button
Alert button
Oct 16, 2023
Ji Qi, Kaixuan Ji, Jifan Yu, Duokang Wang, Bin Xu, Lei Hou, Juanzi Li

Figure 1 for BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation
Figure 2 for BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation
Figure 3 for BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation
Figure 4 for BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation
Viaarxiv icon

OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding

Add code
Bookmark button
Alert button
Sep 25, 2023
Hao Peng, Xiaozhi Wang, Feng Yao, Zimu Wang, Chuzhao Zhu, Kaisheng Zeng, Lei Hou, Juanzi Li

Figure 1 for OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding
Figure 2 for OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding
Figure 3 for OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding
Figure 4 for OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding
Viaarxiv icon

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Add code
Bookmark button
Alert button
Aug 28, 2023
Yushi Bai, Xin Lv, Jiajie Zhang, Hongchang Lyu, Jiankai Tang, Zhidian Huang, Zhengxiao Du, Xiao Liu, Aohan Zeng, Lei Hou, Yuxiao Dong, Jie Tang, Juanzi Li

Figure 1 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 2 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 3 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 4 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Viaarxiv icon