Picture for David Wu

David Wu

MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core

Add code
Apr 21, 2025
Viaarxiv icon

Aligning LLMs with Domain Invariant Reward Models

Add code
Jan 01, 2025
Viaarxiv icon

Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning

Add code
Oct 31, 2024
Figure 1 for Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Figure 2 for Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Figure 3 for Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Figure 4 for Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning
Viaarxiv icon

DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction

Add code
Sep 16, 2024
Figure 1 for DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
Figure 2 for DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
Figure 3 for DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
Figure 4 for DILA: Dictionary Label Attention for Mechanistic Interpretability in High-dimensional Multi-label Medical Coding Prediction
Viaarxiv icon

The Virtues of Pessimism in Inverse Reinforcement Learning

Add code
Feb 08, 2024
Figure 1 for The Virtues of Pessimism in Inverse Reinforcement Learning
Figure 2 for The Virtues of Pessimism in Inverse Reinforcement Learning
Figure 3 for The Virtues of Pessimism in Inverse Reinforcement Learning
Figure 4 for The Virtues of Pessimism in Inverse Reinforcement Learning
Viaarxiv icon

Accelerating Inverse Reinforcement Learning with Expert Bootstrapping

Add code
Feb 04, 2024
Viaarxiv icon

The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT

Add code
Jul 05, 2023
Figure 1 for The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT
Figure 2 for The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT
Figure 3 for The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT
Figure 4 for The KiTS21 Challenge: Automatic segmentation of kidneys, renal tumors, and renal cysts in corticomedullary-phase CT
Viaarxiv icon

CryptOpt: Automatic Optimization of Straightline Code

Add code
May 31, 2023
Figure 1 for CryptOpt: Automatic Optimization of Straightline Code
Figure 2 for CryptOpt: Automatic Optimization of Straightline Code
Figure 3 for CryptOpt: Automatic Optimization of Straightline Code
Figure 4 for CryptOpt: Automatic Optimization of Straightline Code
Viaarxiv icon

Robust Risk-Aware Option Hedging

Add code
Apr 18, 2023
Viaarxiv icon

Improving Chess Commentaries by Combining Language Models with Symbolic Reasoning Engines

Add code
Dec 15, 2022
Viaarxiv icon