Picture for Yeyun Gong

Yeyun Gong

HiCaM: A Hierarchical-Causal Modification Framework for Long-Form Text Modification

Add code
May 30, 2025
Viaarxiv icon

How does Alignment Enhance LLMs' Multilingual Capabilities? A Language Neurons Perspective

Add code
May 27, 2025
Viaarxiv icon

Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling

Add code
Mar 24, 2025
Viaarxiv icon

Process-based Self-Rewarding Language Models

Add code
Mar 05, 2025
Viaarxiv icon

DeepThink: Aligning Language Models with Domain-Specific User Intents

Add code
Feb 08, 2025
Viaarxiv icon

Optimizing Large Language Model Training Using FP4 Quantization

Add code
Jan 28, 2025
Figure 1 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 2 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 3 for Optimizing Large Language Model Training Using FP4 Quantization
Figure 4 for Optimizing Large Language Model Training Using FP4 Quantization
Viaarxiv icon

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Figure 1 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 2 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 3 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 4 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Viaarxiv icon

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Add code
Dec 20, 2024
Figure 1 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 2 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 3 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Figure 4 for Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
Viaarxiv icon

From Intention To Implementation: Automating Biomedical Research via LLMs

Add code
Dec 12, 2024
Viaarxiv icon

Generative Context Distillation

Add code
Nov 24, 2024
Viaarxiv icon