Picture for Jianfeng Gao

Jianfeng Gao

EJ

SAS: Simulated Attention Score

Add code
Jul 10, 2025
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Viaarxiv icon

Training Language Models to Generate Quality Code with Program Analysis Feedback

Add code
May 28, 2025
Viaarxiv icon

EfficientLLM: Efficiency in Large Language Models

Add code
May 20, 2025
Viaarxiv icon

Text Generation Beyond Discrete Token Sampling

Add code
May 20, 2025
Viaarxiv icon

SITE: towards Spatial Intelligence Thorough Evaluation

Add code
May 08, 2025
Viaarxiv icon

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Add code
Apr 30, 2025
Viaarxiv icon

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Add code
Apr 29, 2025
Viaarxiv icon

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Add code
Apr 22, 2025
Viaarxiv icon

TRA: Better Length Generalisation with Threshold Relative Attention

Add code
Apr 02, 2025
Viaarxiv icon