Picture for Ernie Chang

Ernie Chang

Shammie

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice

Add code
Jan 08, 2026
Viaarxiv icon

MobileLLM-Pro Technical Report

Add code
Nov 10, 2025
Viaarxiv icon

Self-Vocabularizing Training for Neural Machine Translation

Add code
Mar 19, 2025
Figure 1 for Self-Vocabularizing Training for Neural Machine Translation
Figure 2 for Self-Vocabularizing Training for Neural Machine Translation
Figure 3 for Self-Vocabularizing Training for Neural Machine Translation
Figure 4 for Self-Vocabularizing Training for Neural Machine Translation
Viaarxiv icon

Agent-as-a-Judge: Evaluate Agents with Agents

Add code
Oct 14, 2024
Figure 1 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 2 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 3 for Agent-as-a-Judge: Evaluate Agents with Agents
Figure 4 for Agent-as-a-Judge: Evaluate Agents with Agents
Viaarxiv icon

Scaling Parameter-Constrained Language Models with Quality Data

Add code
Oct 04, 2024
Figure 1 for Scaling Parameter-Constrained Language Models with Quality Data
Figure 2 for Scaling Parameter-Constrained Language Models with Quality Data
Figure 3 for Scaling Parameter-Constrained Language Models with Quality Data
Figure 4 for Scaling Parameter-Constrained Language Models with Quality Data
Viaarxiv icon

High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

Add code
Jul 04, 2024
Figure 1 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 2 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 3 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Figure 4 for High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching
Viaarxiv icon

Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications

Add code
May 24, 2024
Figure 1 for Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications
Figure 2 for Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications
Figure 3 for Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications
Figure 4 for Basis Selection: Low-Rank Decomposition of Pretrained Large Language Models for Target Applications
Viaarxiv icon

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Add code
Feb 22, 2024
Viaarxiv icon

Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition

Add code
Feb 20, 2024
Figure 1 for Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition
Figure 2 for Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition
Figure 3 for Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition
Figure 4 for Not All Weights Are Created Equal: Enhancing Energy Efficiency in On-Device Streaming Speech Recognition
Viaarxiv icon