Picture for Dawei Song

Dawei Song

ZigzagAttention: Efficient Long-Context Inference with Exclusive Retrieval and Streaming Heads

Add code
Aug 17, 2025
Viaarxiv icon

MALM: A Multi-Information Adapter for Large Language Models to Mitigate Hallucination

Add code
Jun 14, 2025
Viaarxiv icon

MoDification: Mixture of Depths Made Easy

Add code
Oct 18, 2024
Figure 1 for MoDification: Mixture of Depths Made Easy
Figure 2 for MoDification: Mixture of Depths Made Easy
Figure 3 for MoDification: Mixture of Depths Made Easy
Figure 4 for MoDification: Mixture of Depths Made Easy
Viaarxiv icon

Investigating Context Effects in Similarity Judgements in Large Language Models

Add code
Aug 20, 2024
Viaarxiv icon

Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check

Add code
Jun 04, 2024
Figure 1 for Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
Figure 2 for Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
Figure 3 for Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
Figure 4 for Bi-DCSpell: A Bi-directional Detector-Corrector Interactive Framework for Chinese Spelling Check
Viaarxiv icon

A Supervised Information Enhanced Multi-Granularity Contrastive Learning Framework for EEG Based Emotion Recognition

Add code
May 12, 2024
Figure 1 for A Supervised Information Enhanced Multi-Granularity Contrastive Learning Framework for EEG Based Emotion Recognition
Figure 2 for A Supervised Information Enhanced Multi-Granularity Contrastive Learning Framework for EEG Based Emotion Recognition
Figure 3 for A Supervised Information Enhanced Multi-Granularity Contrastive Learning Framework for EEG Based Emotion Recognition
Figure 4 for A Supervised Information Enhanced Multi-Granularity Contrastive Learning Framework for EEG Based Emotion Recognition
Viaarxiv icon

Beyond the Speculative Game: A Survey of Speculative Execution in Large Language Models

Add code
Apr 23, 2024
Viaarxiv icon

LLM-Oriented Retrieval Tuner

Add code
Mar 04, 2024
Figure 1 for LLM-Oriented Retrieval Tuner
Figure 2 for LLM-Oriented Retrieval Tuner
Figure 3 for LLM-Oriented Retrieval Tuner
Figure 4 for LLM-Oriented Retrieval Tuner
Viaarxiv icon

Towards the Law of Capacity Gap in Distilling Language Models

Add code
Nov 13, 2023
Figure 1 for Towards the Law of Capacity Gap in Distilling Language Models
Figure 2 for Towards the Law of Capacity Gap in Distilling Language Models
Figure 3 for Towards the Law of Capacity Gap in Distilling Language Models
Figure 4 for Towards the Law of Capacity Gap in Distilling Language Models
Viaarxiv icon

On Elastic Language Models

Add code
Nov 13, 2023
Viaarxiv icon