Picture for Yuandong Tian

Yuandong Tian

Composing Global Optimizers to Reasoning Tasks via Algebraic Objects in Neural Nets

Add code
Oct 02, 2024
Viaarxiv icon

The Perfect Blend: Redefining RLHF with Mixture of Judges

Add code
Sep 30, 2024
Figure 1 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 2 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 3 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Figure 4 for The Perfect Blend: Redefining RLHF with Mixture of Judges
Viaarxiv icon

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Add code
Jul 28, 2024
Viaarxiv icon

From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients

Add code
Jul 15, 2024
Viaarxiv icon

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients

Add code
Jul 11, 2024
Figure 1 for Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients
Figure 2 for Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients
Figure 3 for Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients
Figure 4 for Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients
Viaarxiv icon

LoCoCo: Dropping In Convolutions for Long Context Compression

Add code
Jun 08, 2024
Viaarxiv icon

SpinQuant: LLM quantization with learned rotations

Add code
May 28, 2024
Viaarxiv icon

AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs

Add code
Apr 21, 2024
Viaarxiv icon

TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Add code
Apr 18, 2024
Viaarxiv icon

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Add code
Mar 06, 2024
Viaarxiv icon