Picture for Liliang Ren

Liliang Ren

SAS: Simulated Attention Score

Add code
Jul 10, 2025
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Viaarxiv icon

PaTH Attention: Position Encoding via Accumulating Householder Transformations

Add code
May 22, 2025
Viaarxiv icon

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Add code
Apr 30, 2025
Viaarxiv icon

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Add code
Apr 29, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues

Add code
Jan 18, 2025
Viaarxiv icon

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Add code
Jun 11, 2024
Figure 1 for Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Figure 2 for Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Figure 3 for Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Figure 4 for Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Viaarxiv icon

Sparse Modular Activation for Efficient Sequence Modeling

Add code
Jul 11, 2023
Figure 1 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 2 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 3 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 4 for Sparse Modular Activation for Efficient Sequence Modeling
Viaarxiv icon

C-PMI: Conditional Pointwise Mutual Information for Turn-level Dialogue Evaluation

Add code
Jun 27, 2023
Figure 1 for C-PMI: Conditional Pointwise Mutual Information for Turn-level Dialogue Evaluation
Viaarxiv icon