Picture for Ben Athiwaratkun

Ben Athiwaratkun

Staircase Streaming for Low-Latency Multi-Agent Inference

Add code
Oct 06, 2025
Figure 1 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 2 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 3 for Staircase Streaming for Low-Latency Multi-Agent Inference
Figure 4 for Staircase Streaming for Low-Latency Multi-Agent Inference
Viaarxiv icon

Data Diversification Methods In Alignment Enhance Math Performance In LLMs

Add code
Jul 02, 2025
Figure 1 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 2 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 3 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 4 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Viaarxiv icon

Disentangling Reasoning and Knowledge in Medical Large Language Models

Add code
May 16, 2025
Figure 1 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 2 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 3 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 4 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Viaarxiv icon

Improving Model Alignment Through Collective Intelligence of Open-Source LLMS

Add code
May 05, 2025
Figure 1 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 2 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 3 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Figure 4 for Improving Model Alignment Through Collective Intelligence of Open-Source LLMS
Viaarxiv icon

How Well Can General Vision-Language Models Learn Medicine By Watching Public Educational Videos?

Add code
Apr 19, 2025
Viaarxiv icon

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods

Add code
Apr 18, 2025
Viaarxiv icon

Scaling Instruction-Tuned LLMs to Million-Token Contexts via Hierarchical Synthetic Data Generation

Add code
Apr 17, 2025
Viaarxiv icon

Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping

Add code
Jan 11, 2025
Figure 1 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 2 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 3 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Figure 4 for Ladder-residual: parallelism-aware architecture for accelerating large model inference with communication overlapping
Viaarxiv icon

RedPajama: an Open Dataset for Training Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Training-Free Activation Sparsity in Large Language Models

Add code
Aug 26, 2024
Viaarxiv icon