Picture for Jing Nathan Yan

Jing Nathan Yan

Spend Your Rollouts Where It Counts: Rollout Allocation for Group-Based RL Post-Training

Add code
May 26, 2026
Viaarxiv icon

The Efficiency Gap in Byte Modeling

Add code
May 13, 2026
Viaarxiv icon

OverFill: Two-Stage Models for Efficient Language Model Decoding

Add code
Aug 11, 2025
Viaarxiv icon

Fairness Practices in Industry: A Case Study in Machine Learning Teams Building Recommender Systems

Add code
May 26, 2025
Viaarxiv icon

A Controlled Study on Long Context Extension and Generalization in LLMs

Add code
Sep 18, 2024
Figure 1 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 2 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 3 for A Controlled Study on Long Context Extension and Generalization in LLMs
Figure 4 for A Controlled Study on Long Context Extension and Generalization in LLMs
Viaarxiv icon

EG4D: Explicit Generation of 4D Object without Score Distillation

Add code
May 28, 2024
Viaarxiv icon

MambaByte: Token-free Selective State Space Model

Add code
Jan 24, 2024
Figure 1 for MambaByte: Token-free Selective State Space Model
Figure 2 for MambaByte: Token-free Selective State Space Model
Figure 3 for MambaByte: Token-free Selective State Space Model
Figure 4 for MambaByte: Token-free Selective State Space Model
Viaarxiv icon

Diffusion Models Without Attention

Add code
Nov 30, 2023
Figure 1 for Diffusion Models Without Attention
Figure 2 for Diffusion Models Without Attention
Figure 3 for Diffusion Models Without Attention
Figure 4 for Diffusion Models Without Attention
Viaarxiv icon

On What Basis? Predicting Text Preference Via Structured Comparative Reasoning

Add code
Nov 14, 2023
Figure 1 for On What Basis? Predicting Text Preference Via Structured Comparative Reasoning
Figure 2 for On What Basis? Predicting Text Preference Via Structured Comparative Reasoning
Figure 3 for On What Basis? Predicting Text Preference Via Structured Comparative Reasoning
Figure 4 for On What Basis? Predicting Text Preference Via Structured Comparative Reasoning
Viaarxiv icon

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning

Add code
Nov 13, 2023
Viaarxiv icon