Picture for Jeff Z. Pan

Jeff Z. Pan

An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs

Add code
Feb 27, 2025
Figure 1 for An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs
Figure 2 for An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs
Figure 3 for An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs
Viaarxiv icon

GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation

Add code
Feb 26, 2025
Viaarxiv icon

Evaluating and Improving Graph to Text Generation with Large Language Models

Add code
Jan 24, 2025
Viaarxiv icon

GeAR: Graph-enhanced Agent for Retrieval-augmented Generation

Add code
Dec 24, 2024
Viaarxiv icon

MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge

Add code
Dec 22, 2024
Figure 1 for MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
Figure 2 for MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
Figure 3 for MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
Figure 4 for MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge
Viaarxiv icon

From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle

Add code
Dec 17, 2024
Figure 1 for From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle
Figure 2 for From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle
Figure 3 for From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle
Figure 4 for From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle
Viaarxiv icon

Multi-level Matching Network for Multimodal Entity Linking

Add code
Dec 11, 2024
Viaarxiv icon

Atomic Fact Decomposition Helps Attributed Question Answering

Add code
Oct 22, 2024
Figure 1 for Atomic Fact Decomposition Helps Attributed Question Answering
Figure 2 for Atomic Fact Decomposition Helps Attributed Question Answering
Figure 3 for Atomic Fact Decomposition Helps Attributed Question Answering
Figure 4 for Atomic Fact Decomposition Helps Attributed Question Answering
Viaarxiv icon

MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps

Add code
Oct 18, 2024
Figure 1 for MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps
Figure 2 for MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps
Figure 3 for MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps
Figure 4 for MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps
Viaarxiv icon

Less is More: Making Smaller Language Models Competent Subgraph Retrievers for Multi-hop KGQA

Add code
Oct 08, 2024
Viaarxiv icon