Picture for Tuo Zhao

Tuo Zhao

BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

Add code
Feb 16, 2024
Viaarxiv icon

Data Diversity Matters for Robust Instruction Tuning

Add code
Nov 21, 2023
Viaarxiv icon

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Add code
Nov 03, 2023
Viaarxiv icon

Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms

Add code
Oct 30, 2023
Figure 1 for Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Figure 2 for Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Figure 3 for Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Figure 4 for Model-Based Reparameterization Policy Gradient Methods: Theory and Practical Algorithms
Viaarxiv icon

Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult

Add code
Oct 26, 2023
Figure 1 for Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult
Figure 2 for Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult
Figure 3 for Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult
Figure 4 for Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult
Viaarxiv icon

Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification

Add code
Oct 25, 2023
Figure 1 for Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification
Figure 2 for Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification
Figure 3 for Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification
Figure 4 for Score Matching-based Pseudolikelihood Estimation of Neural Marked Spatio-Temporal Point Process with Uncertainty Quantification
Viaarxiv icon

SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process

Add code
Oct 25, 2023
Figure 1 for SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process
Figure 2 for SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process
Figure 3 for SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process
Figure 4 for SMURF-THP: Score Matching-based UnceRtainty quantiFication for Transformer Hawkes Process
Viaarxiv icon

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Add code
Oct 23, 2023
Figure 1 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 2 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 3 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Figure 4 for LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models
Viaarxiv icon

Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer

Add code
Oct 19, 2023
Figure 1 for Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer
Figure 2 for Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer
Figure 3 for Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer
Figure 4 for Efficient Long-Range Transformers: You Need to Attend More, but Not Necessarily at Every Layer
Viaarxiv icon

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms

Add code
Oct 16, 2023
Viaarxiv icon