Picture for Lei Li

Lei Li

Carnegie Mellon University

Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models

Add code
Oct 25, 2024
Figure 1 for Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
Figure 2 for Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
Figure 3 for Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
Figure 4 for Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models
Viaarxiv icon

Why Does the Effective Context Length of LLMs Fall Short?

Add code
Oct 24, 2024
Figure 1 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 2 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 3 for Why Does the Effective Context Length of LLMs Fall Short?
Figure 4 for Why Does the Effective Context Length of LLMs Fall Short?
Viaarxiv icon

CA*: Addressing Evaluation Pitfalls in Computation-Aware Latency for Simultaneous Speech Translation

Add code
Oct 21, 2024
Viaarxiv icon

ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom

Add code
Oct 18, 2024
Figure 1 for ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
Figure 2 for ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
Figure 3 for ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
Figure 4 for ProReason: Multi-Modal Proactive Reasoning with Decoupled Eyesight and Wisdom
Viaarxiv icon

Revealing the Barriers of Language Agents in Planning

Add code
Oct 16, 2024
Figure 1 for Revealing the Barriers of Language Agents in Planning
Figure 2 for Revealing the Barriers of Language Agents in Planning
Figure 3 for Revealing the Barriers of Language Agents in Planning
Figure 4 for Revealing the Barriers of Language Agents in Planning
Viaarxiv icon

Understanding the Role of LLMs in Multimodal Evaluation Benchmarks

Add code
Oct 16, 2024
Viaarxiv icon

Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling

Add code
Oct 15, 2024
Figure 1 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 2 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 3 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Figure 4 for Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling
Viaarxiv icon

TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs

Add code
Oct 14, 2024
Figure 1 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 2 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 3 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Figure 4 for TMGBench: A Systematic Game Benchmark for Evaluating Strategic Reasoning Abilities of LLMs
Viaarxiv icon

VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment

Add code
Oct 12, 2024
Figure 1 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 2 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 3 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Figure 4 for VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Viaarxiv icon

Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models

Add code
Oct 10, 2024
Figure 1 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 2 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 3 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Figure 4 for Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
Viaarxiv icon