Picture for Zhiguo Wang

Zhiguo Wang

Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall

Add code
Apr 24, 2024
Figure 1 for Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Figure 2 for Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Figure 3 for Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Figure 4 for Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Viaarxiv icon

Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks

Add code
Jan 31, 2024
Viaarxiv icon

Globally Optimal Beamforming Design for Integrated Sensing and Communication Systems

Add code
Sep 13, 2023
Figure 1 for Globally Optimal Beamforming Design for Integrated Sensing and Communication Systems
Figure 2 for Globally Optimal Beamforming Design for Integrated Sensing and Communication Systems
Viaarxiv icon

Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning

Add code
Aug 10, 2023
Figure 1 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 2 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 3 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 4 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Viaarxiv icon

NIPD: A Federated Learning Person Detection Benchmark Based on Real-World Non-IID Data

Add code
Jun 28, 2023
Figure 1 for NIPD: A Federated Learning Person Detection Benchmark Based on Real-World Non-IID Data
Figure 2 for NIPD: A Federated Learning Person Detection Benchmark Based on Real-World Non-IID Data
Figure 3 for NIPD: A Federated Learning Person Detection Benchmark Based on Real-World Non-IID Data
Figure 4 for NIPD: A Federated Learning Person Detection Benchmark Based on Real-World Non-IID Data
Viaarxiv icon

XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations

Add code
Jun 07, 2023
Figure 1 for XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations
Figure 2 for XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations
Figure 3 for XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations
Figure 4 for XSemPLR: Cross-Lingual Semantic Parsing in Multiple Natural Languages and Meaning Representations
Viaarxiv icon

Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge

Add code
May 30, 2023
Figure 1 for Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Figure 2 for Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Figure 3 for Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Figure 4 for Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Viaarxiv icon

Benchmarking Diverse-Modal Entity Linking with Generative Models

Add code
May 27, 2023
Figure 1 for Benchmarking Diverse-Modal Entity Linking with Generative Models
Figure 2 for Benchmarking Diverse-Modal Entity Linking with Generative Models
Figure 3 for Benchmarking Diverse-Modal Entity Linking with Generative Models
Figure 4 for Benchmarking Diverse-Modal Entity Linking with Generative Models
Viaarxiv icon

UNITE: A Unified Benchmark for Text-to-SQL Evaluation

Add code
May 26, 2023
Figure 1 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Figure 2 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Figure 3 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Viaarxiv icon

Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness

Add code
Jan 21, 2023
Figure 1 for Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Figure 2 for Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Figure 3 for Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Figure 4 for Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Viaarxiv icon