Picture for Dong Yu

Dong Yu

Teaching LLMs to Refine with Tools

Add code
Dec 22, 2024
Figure 1 for Teaching LLMs to Refine with Tools
Figure 2 for Teaching LLMs to Refine with Tools
Figure 3 for Teaching LLMs to Refine with Tools
Figure 4 for Teaching LLMs to Refine with Tools
Viaarxiv icon

Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models

Add code
Dec 21, 2024
Figure 1 for Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Figure 2 for Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Figure 3 for Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Figure 4 for Attention Entropy is a Key Factor: An Analysis of Parallel Context Encoding with Full-attention-based Pre-trained Language Models
Viaarxiv icon

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Add code
Nov 26, 2024
Figure 1 for Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens
Figure 2 for Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens
Figure 3 for Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens
Figure 4 for Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens
Viaarxiv icon

Federated Incremental Named Entity Recognition

Add code
Nov 18, 2024
Figure 1 for Federated Incremental Named Entity Recognition
Figure 2 for Federated Incremental Named Entity Recognition
Figure 3 for Federated Incremental Named Entity Recognition
Figure 4 for Federated Incremental Named Entity Recognition
Viaarxiv icon

Evaluating Moral Beliefs across LLMs through a Pluralistic Framework

Add code
Nov 06, 2024
Viaarxiv icon

OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization

Add code
Oct 25, 2024
Figure 1 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 2 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 3 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Figure 4 for OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
Viaarxiv icon

LoGU: Long-form Generation with Uncertainty Expressions

Add code
Oct 18, 2024
Figure 1 for LoGU: Long-form Generation with Uncertainty Expressions
Figure 2 for LoGU: Long-form Generation with Uncertainty Expressions
Figure 3 for LoGU: Long-form Generation with Uncertainty Expressions
Figure 4 for LoGU: Long-form Generation with Uncertainty Expressions
Viaarxiv icon

Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers

Add code
Oct 17, 2024
Viaarxiv icon

Atomic Calibration of LLMs in Long-Form Generations

Add code
Oct 17, 2024
Figure 1 for Atomic Calibration of LLMs in Long-Form Generations
Figure 2 for Atomic Calibration of LLMs in Long-Form Generations
Figure 3 for Atomic Calibration of LLMs in Long-Form Generations
Figure 4 for Atomic Calibration of LLMs in Long-Form Generations
Viaarxiv icon

LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory

Add code
Oct 14, 2024
Figure 1 for LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory
Figure 2 for LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory
Figure 3 for LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory
Figure 4 for LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive Memory
Viaarxiv icon