Picture for Haotian Wang

Haotian Wang

BeamAggR: Beam Aggregation Reasoning over Multi-source Knowledge for Multi-hop Question Answering

Add code
Jun 28, 2024
Viaarxiv icon

UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions

Add code
Jun 18, 2024
Figure 1 for UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Figure 2 for UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Figure 3 for UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Figure 4 for UBENCH: Benchmarking Uncertainty in Large Language Models with Multiple Choice Questions
Viaarxiv icon

An Information Bottleneck Perspective for Effective Noise Filtering on Retrieval-Augmented Generation

Add code
Jun 03, 2024
Viaarxiv icon

PPA-Game: Characterizing and Learning Competitive Dynamics Among Online Content Creators

Add code
Mar 22, 2024
Viaarxiv icon

A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition

Add code
Mar 07, 2024
Figure 1 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 2 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 3 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Figure 4 for A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
Viaarxiv icon

Composite Active Learning: Towards Multi-Domain Active Learning with Theoretical Guarantees

Add code
Feb 03, 2024
Viaarxiv icon

Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates

Add code
Dec 08, 2023
Figure 1 for Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Figure 2 for Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Figure 3 for Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Figure 4 for Apollo's Oracle: Retrieval-Augmented Reasoning in Multi-Agent Debates
Viaarxiv icon

TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models

Add code
Nov 29, 2023
Figure 1 for TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Figure 2 for TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Figure 3 for TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Figure 4 for TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models
Viaarxiv icon

Trends in Integration of Knowledge and Large Language Models: A Survey and Taxonomy of Methods, Benchmarks, and Applications

Add code
Nov 10, 2023
Viaarxiv icon

A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open Questions

Add code
Nov 09, 2023
Viaarxiv icon