Picture for Xiaodong Liu

Xiaodong Liu

GRIN: GRadient-INformed MoE

Add code
Sep 18, 2024
Viaarxiv icon

Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering

Add code
Sep 16, 2024
Viaarxiv icon

Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

Add code
Aug 26, 2024
Figure 1 for Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning
Figure 2 for Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning
Figure 3 for Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning
Figure 4 for Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning
Viaarxiv icon

Diversifying the Expert Knowledge for Task-Agnostic Pruning in Sparse Mixture-of-Experts

Add code
Jul 12, 2024
Viaarxiv icon

DefSent+: Improving sentence embeddings of language models by projecting definition sentences into a quasi-isotropic or isotropic vector space of unlimited dictionary entries

Add code
May 25, 2024
Viaarxiv icon

SWEA: Changing Factual Knowledge in Large Language Models via Subject Word Embedding Altering

Add code
Jan 31, 2024
Viaarxiv icon

Towards Consistent Natural-Language Explanations via Explanation-Consistency Finetuning

Add code
Jan 25, 2024
Viaarxiv icon

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

Add code
Jan 05, 2024
Viaarxiv icon

Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Add code
Nov 03, 2023
Viaarxiv icon

Automatic Hallucination Assessment for Aligned Large Language Models via Transferable Adversarial Attacks

Add code
Oct 19, 2023
Viaarxiv icon