Picture for Kaixin Ma

Kaixin Ma

DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems

Add code
Jul 15, 2024
Viaarxiv icon

MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning

Add code
Apr 24, 2024
Viaarxiv icon

SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense

Add code
Apr 22, 2024
Viaarxiv icon

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Add code
Jan 28, 2024
Viaarxiv icon

Dense X Retrieval: What Retrieval Granularity Should We Use?

Add code
Dec 12, 2023
Viaarxiv icon

Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models

Add code
Nov 15, 2023
Viaarxiv icon

BRAINTEASER: Lateral Thinking Puzzles for Large Language Models

Add code
Oct 10, 2023
Figure 1 for BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
Figure 2 for BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
Figure 3 for BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
Figure 4 for BRAINTEASER: Lateral Thinking Puzzles for Large Language Models
Viaarxiv icon

LASER: LLM Agent with State-Space Exploration for Web Navigation

Add code
Sep 15, 2023
Viaarxiv icon

A Study of Situational Reasoning for Traffic Understanding

Add code
Jun 05, 2023
Figure 1 for A Study of Situational Reasoning for Traffic Understanding
Figure 2 for A Study of Situational Reasoning for Traffic Understanding
Figure 3 for A Study of Situational Reasoning for Traffic Understanding
Figure 4 for A Study of Situational Reasoning for Traffic Understanding
Viaarxiv icon

Knowledge-enhanced Agents for Interactive Text Games

Add code
May 08, 2023
Figure 1 for Knowledge-enhanced Agents for Interactive Text Games
Figure 2 for Knowledge-enhanced Agents for Interactive Text Games
Figure 3 for Knowledge-enhanced Agents for Interactive Text Games
Figure 4 for Knowledge-enhanced Agents for Interactive Text Games
Viaarxiv icon