Picture for Patrick Ng

Patrick Ng

Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall

Add code
Apr 24, 2024
Figure 1 for Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Figure 2 for Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Figure 3 for Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Figure 4 for Towards a Holistic Evaluation of LLMs on Factual Knowledge Recall
Viaarxiv icon

Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks

Add code
Jan 31, 2024
Viaarxiv icon

Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning

Add code
Aug 10, 2023
Figure 1 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 2 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 3 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Figure 4 for Few-Shot Data-to-Text Generation via Unified Representation and Multi-Source Learning
Viaarxiv icon

Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge

Add code
May 30, 2023
Figure 1 for Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Figure 2 for Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Figure 3 for Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Figure 4 for Generate then Select: Open-ended Visual Question Answering Guided by World Knowledge
Viaarxiv icon

Benchmarking Diverse-Modal Entity Linking with Generative Models

Add code
May 27, 2023
Figure 1 for Benchmarking Diverse-Modal Entity Linking with Generative Models
Figure 2 for Benchmarking Diverse-Modal Entity Linking with Generative Models
Figure 3 for Benchmarking Diverse-Modal Entity Linking with Generative Models
Figure 4 for Benchmarking Diverse-Modal Entity Linking with Generative Models
Viaarxiv icon

UNITE: A Unified Benchmark for Text-to-SQL Evaluation

Add code
May 26, 2023
Figure 1 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Figure 2 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Figure 3 for UNITE: A Unified Benchmark for Text-to-SQL Evaluation
Viaarxiv icon

Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness

Add code
Jan 21, 2023
Figure 1 for Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Figure 2 for Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Figure 3 for Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Figure 4 for Dr.Spider: A Diagnostic Evaluation Benchmark towards Text-to-SQL Robustness
Viaarxiv icon

Importance of Synthesizing High-quality Data for Text-to-SQL Parsing

Add code
Dec 17, 2022
Figure 1 for Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Figure 2 for Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Figure 3 for Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Figure 4 for Importance of Synthesizing High-quality Data for Text-to-SQL Parsing
Viaarxiv icon

Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations

Add code
Dec 17, 2022
Figure 1 for Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
Figure 2 for Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
Figure 3 for Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
Figure 4 for Improving Cross-task Generalization of Unified Table-to-text Models with Compositional Task Configurations
Viaarxiv icon

DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases

Add code
Sep 30, 2022
Figure 1 for DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases
Figure 2 for DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases
Figure 3 for DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases
Figure 4 for DecAF: Joint Decoding of Answers and Logical Forms for Question Answering over Knowledge Bases
Viaarxiv icon