Picture for Dan Roth

Dan Roth

Shammie

A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners

Add code
Jun 16, 2024
Figure 1 for A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
Figure 2 for A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
Figure 3 for A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
Figure 4 for A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
Viaarxiv icon

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Add code
Jun 13, 2024
Figure 1 for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
Figure 2 for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
Figure 3 for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
Figure 4 for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models
Viaarxiv icon

MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding

Add code
Jun 13, 2024
Figure 1 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 2 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 3 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Figure 4 for MuirBench: A Comprehensive Benchmark for Robust Multi-image Understanding
Viaarxiv icon

Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?

Add code
Jun 11, 2024
Figure 1 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 2 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 3 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Figure 4 for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?
Viaarxiv icon

Devil's Advocate: Anticipatory Reflection for LLM Agents

Add code
May 29, 2024
Figure 1 for Devil's Advocate: Anticipatory Reflection for LLM Agents
Figure 2 for Devil's Advocate: Anticipatory Reflection for LLM Agents
Figure 3 for Devil's Advocate: Anticipatory Reflection for LLM Agents
Figure 4 for Devil's Advocate: Anticipatory Reflection for LLM Agents
Viaarxiv icon

ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models

Add code
May 28, 2024
Viaarxiv icon

BLINK: Multimodal Large Language Models Can See but Not Perceive

Add code
Apr 18, 2024
Figure 1 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 2 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 3 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Figure 4 for BLINK: Multimodal Large Language Models Can See but Not Perceive
Viaarxiv icon

BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models

Add code
Apr 18, 2024
Figure 1 for BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Figure 2 for BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Figure 3 for BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Figure 4 for BIRD: A Trustworthy Bayesian Inference Framework for Large Language Models
Viaarxiv icon

Fewer Truncations Improve Language Modeling

Add code
Apr 16, 2024
Figure 1 for Fewer Truncations Improve Language Modeling
Figure 2 for Fewer Truncations Improve Language Modeling
Figure 3 for Fewer Truncations Improve Language Modeling
Figure 4 for Fewer Truncations Improve Language Modeling
Viaarxiv icon

Is Table Retrieval a Solved Problem? Join-Aware Multi-Table Retrieval

Add code
Apr 15, 2024
Figure 1 for Is Table Retrieval a Solved Problem? Join-Aware Multi-Table Retrieval
Figure 2 for Is Table Retrieval a Solved Problem? Join-Aware Multi-Table Retrieval
Figure 3 for Is Table Retrieval a Solved Problem? Join-Aware Multi-Table Retrieval
Figure 4 for Is Table Retrieval a Solved Problem? Join-Aware Multi-Table Retrieval
Viaarxiv icon