Picture for Keming Lu

Keming Lu

Qwen2 Technical Report

Add code
Jul 16, 2024
Figure 1 for Qwen2 Technical Report
Figure 2 for Qwen2 Technical Report
Figure 3 for Qwen2 Technical Report
Figure 4 for Qwen2 Technical Report
Viaarxiv icon

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

Add code
Jun 30, 2024
Viaarxiv icon

The Reason behind Good or Bad: Towards a Better Mathematical Verifier with Natural Language Feedback

Add code
Jun 20, 2024
Viaarxiv icon

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Add code
Jun 19, 2024
Viaarxiv icon

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

Add code
Jun 04, 2024
Figure 1 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 2 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 3 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Figure 4 for PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
Viaarxiv icon

Towards Scalable Automated Alignment of LLMs: A Survey

Add code
Jun 03, 2024
Viaarxiv icon

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

Add code
May 28, 2024
Viaarxiv icon

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

Add code
Jan 23, 2024
Viaarxiv icon

Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models

Add code
Nov 15, 2023
Viaarxiv icon

Speculative Contrastive Decoding

Add code
Nov 15, 2023
Figure 1 for Speculative Contrastive Decoding
Figure 2 for Speculative Contrastive Decoding
Figure 3 for Speculative Contrastive Decoding
Viaarxiv icon