Picture for Liliang Ren

Liliang Ren

SAS: Simulated Attention Score

Add code
Jul 10, 2025
Viaarxiv icon

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

Add code
Jul 09, 2025
Figure 1 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 2 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 3 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Figure 4 for Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
Viaarxiv icon

PaTH Attention: Position Encoding via Accumulating Householder Transformations

Add code
May 22, 2025
Viaarxiv icon

Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math

Add code
Apr 30, 2025
Figure 1 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 2 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 3 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Figure 4 for Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math
Viaarxiv icon

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Add code
Apr 29, 2025
Figure 1 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 2 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 3 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Figure 4 for Reinforcement Learning for Reasoning in Large Language Models with One Training Example
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Figure 1 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 2 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 3 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Figure 4 for Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs
Viaarxiv icon

BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues

Add code
Jan 18, 2025
Viaarxiv icon

Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling

Add code
Jun 11, 2024
Figure 1 for Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Figure 2 for Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Figure 3 for Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Figure 4 for Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
Viaarxiv icon

Sparse Modular Activation for Efficient Sequence Modeling

Add code
Jul 11, 2023
Figure 1 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 2 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 3 for Sparse Modular Activation for Efficient Sequence Modeling
Figure 4 for Sparse Modular Activation for Efficient Sequence Modeling
Viaarxiv icon

C-PMI: Conditional Pointwise Mutual Information for Turn-level Dialogue Evaluation

Add code
Jun 27, 2023
Figure 1 for C-PMI: Conditional Pointwise Mutual Information for Turn-level Dialogue Evaluation
Viaarxiv icon