Picture for Jiayi Yuan

Jiayi Yuan

Henry

Give Me FP32 or Give Me Death? Challenges and Solutions for Reproducible Reasoning

Add code
Jun 11, 2025
Viaarxiv icon

AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models

Add code
May 28, 2025
Viaarxiv icon

Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

Add code
Mar 20, 2025
Viaarxiv icon

Interpreting and Steering LLMs with Mutual Information-based Explanations on Sparse Autoencoders

Add code
Feb 21, 2025
Viaarxiv icon

Robot Learning with Super-Linear Scaling

Add code
Dec 02, 2024
Viaarxiv icon

InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma

Add code
Nov 15, 2024
Figure 1 for InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Figure 2 for InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Figure 3 for InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Figure 4 for InvestESG: A multi-agent reinforcement learning benchmark for studying climate investment as a social dilemma
Viaarxiv icon

Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion

Add code
Oct 06, 2024
Figure 1 for Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Figure 2 for Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Figure 3 for Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Figure 4 for Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Viaarxiv icon

DHP Benchmark: Are LLMs Good NLG Evaluators?

Add code
Aug 25, 2024
Viaarxiv icon

KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches

Add code
Jul 01, 2024
Figure 1 for KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Figure 2 for KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Figure 3 for KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Figure 4 for KV Cache Compression, But What Must We Give in Return? A Comprehensive Benchmark of Long Context Capable Approaches
Viaarxiv icon

Understanding Different Design Choices in Training Large Time Series Models

Add code
Jun 20, 2024
Figure 1 for Understanding Different Design Choices in Training Large Time Series Models
Figure 2 for Understanding Different Design Choices in Training Large Time Series Models
Figure 3 for Understanding Different Design Choices in Training Large Time Series Models
Figure 4 for Understanding Different Design Choices in Training Large Time Series Models
Viaarxiv icon