Picture for Lei Li

Lei Li

Carnegie Mellon University

Revealing the Barriers of Language Agents in Planning

Add code
Oct 16, 2024
Figure 1 for Revealing the Barriers of Language Agents in Planning
Figure 2 for Revealing the Barriers of Language Agents in Planning
Figure 3 for Revealing the Barriers of Language Agents in Planning
Figure 4 for Revealing the Barriers of Language Agents in Planning
Viaarxiv icon

Understanding the Role of LLMs in Multimodal Evaluation Benchmarks

Add code
Oct 16, 2024
Viaarxiv icon

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

Add code
Oct 15, 2024
Figure 1 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 2 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 3 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 4 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Viaarxiv icon

TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs

Add code
Oct 14, 2024
Figure 1 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 2 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 3 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 4 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Viaarxiv icon

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

Add code
Oct 12, 2024
Figure 1 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 2 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 3 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 4 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Viaarxiv icon

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Add code
Oct 10, 2024
Figure 1 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 2 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 3 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 4 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Viaarxiv icon

Temporal Reasoning Transfer from Text to Video

Add code
Oct 08, 2024
Figure 1 for Temporal Reasoning Transfer from Text to Video
Figure 2 for Temporal Reasoning Transfer from Text to Video
Figure 3 for Temporal Reasoning Transfer from Text to Video
Figure 4 for Temporal Reasoning Transfer from Text to Video
Viaarxiv icon

Translation Canvas: An Explainable Interface to Pinpoint and Analyze Translation Systems

Add code
Oct 07, 2024
Viaarxiv icon

CAR: Controllable Autoregressive Modeling for Visual Generation

Add code
Oct 07, 2024
Viaarxiv icon

Adaptive Masking Enhances Visual Grounding

Add code
Oct 04, 2024
Figure 1 for Adaptive Masking Enhances Visual Grounding
Figure 2 for Adaptive Masking Enhances Visual Grounding
Figure 3 for Adaptive Masking Enhances Visual Grounding
Figure 4 for Adaptive Masking Enhances Visual Grounding
Viaarxiv icon