Picture for Ruoming Pang

Ruoming Pang

Reusing Pre-Training Data at Test Time is a Compute Multiplier

Add code
Nov 06, 2025
Figure 1 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 2 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 3 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Figure 4 for Reusing Pre-Training Data at Test Time is a Compute Multiplier
Viaarxiv icon

MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer

Add code
Sep 19, 2025
Figure 1 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 2 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 3 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Figure 4 for MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer
Viaarxiv icon

Synthetic bootstrapped pretraining

Add code
Sep 17, 2025
Viaarxiv icon

Can External Validation Tools Improve Annotation Quality for LLM-as-a-Judge?

Add code
Jul 22, 2025
Viaarxiv icon

RATTENTION: Towards the Minimal Sliding Window Size in Local-Global Attention Models

Add code
Jun 18, 2025
Viaarxiv icon

Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations

Add code
Apr 10, 2025
Viaarxiv icon

Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics

Add code
Mar 03, 2025
Figure 1 for Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Figure 2 for Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Figure 3 for Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Figure 4 for Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics
Viaarxiv icon

Instruction-Following Pruning for Large Language Models

Add code
Jan 07, 2025
Figure 1 for Instruction-Following Pruning for Large Language Models
Figure 2 for Instruction-Following Pruning for Large Language Models
Figure 3 for Instruction-Following Pruning for Large Language Models
Figure 4 for Instruction-Following Pruning for Large Language Models
Viaarxiv icon

Improve Vision Language Model Chain-of-thought Reasoning

Add code
Oct 21, 2024
Figure 1 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 2 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 3 for Improve Vision Language Model Chain-of-thought Reasoning
Figure 4 for Improve Vision Language Model Chain-of-thought Reasoning
Viaarxiv icon

Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Add code
Oct 02, 2024
Figure 1 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 2 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 3 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Figure 4 for Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo
Viaarxiv icon