Picture for Dacheng Tao

Dacheng Tao

and Other Contributors

Revisiting Knowledge Distillation for Autoregressive Language Models

Add code
Feb 19, 2024
Figure 1 for Revisiting Knowledge Distillation for Autoregressive Language Models
Figure 2 for Revisiting Knowledge Distillation for Autoregressive Language Models
Figure 3 for Revisiting Knowledge Distillation for Autoregressive Language Models
Figure 4 for Revisiting Knowledge Distillation for Autoregressive Language Models
Viaarxiv icon

DB-LLM: Accurate Dual-Binarization for Efficient LLMs

Add code
Feb 19, 2024
Figure 1 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 2 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 3 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Figure 4 for DB-LLM: Accurate Dual-Binarization for Efficient LLMs
Viaarxiv icon

ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding

Add code
Feb 19, 2024
Figure 1 for ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding
Figure 2 for ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding
Figure 3 for ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding
Figure 4 for ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding
Viaarxiv icon

Towards Theoretical Understandings of Self-Consuming Generative Models

Add code
Feb 19, 2024
Figure 1 for Towards Theoretical Understandings of Self-Consuming Generative Models
Viaarxiv icon

Continual Learning on Graphs: Challenges, Solutions, and Opportunities

Add code
Feb 18, 2024
Figure 1 for Continual Learning on Graphs: Challenges, Solutions, and Opportunities
Figure 2 for Continual Learning on Graphs: Challenges, Solutions, and Opportunities
Figure 3 for Continual Learning on Graphs: Challenges, Solutions, and Opportunities
Figure 4 for Continual Learning on Graphs: Challenges, Solutions, and Opportunities
Viaarxiv icon

Mitigating Reward Hacking via Information-Theoretic Reward Modeling

Add code
Feb 16, 2024
Viaarxiv icon

Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

Add code
Feb 13, 2024
Viaarxiv icon

Large Language Models as an Indirect Reasoner: Contrapositive and Contradiction for Automated Reasoning

Add code
Feb 06, 2024
Viaarxiv icon

Poisson Process for Bayesian Optimization

Add code
Feb 05, 2024
Figure 1 for Poisson Process for Bayesian Optimization
Figure 2 for Poisson Process for Bayesian Optimization
Figure 3 for Poisson Process for Bayesian Optimization
Figure 4 for Poisson Process for Bayesian Optimization
Viaarxiv icon

Representation Surgery for Multi-Task Model Merging

Add code
Feb 05, 2024
Figure 1 for Representation Surgery for Multi-Task Model Merging
Figure 2 for Representation Surgery for Multi-Task Model Merging
Figure 3 for Representation Surgery for Multi-Task Model Merging
Figure 4 for Representation Surgery for Multi-Task Model Merging
Viaarxiv icon