Picture for Tan M. Nguyen

Tan M. Nguyen

Revisiting Transformers with Insights from Image Filtering

Add code
Jun 12, 2025
Viaarxiv icon

Tree-Sliced Wasserstein Distance with Nonlinear Projection

Add code
May 02, 2025
Viaarxiv icon

Distance-Based Tree-Sliced Wasserstein Distance

Add code
Mar 14, 2025
Viaarxiv icon

Spherical Tree-Sliced Wasserstein Distance

Add code
Mar 14, 2025
Viaarxiv icon

MoLEx: Mixture of Layer Experts for Finetuning with Sparse Upcycling

Add code
Mar 14, 2025
Viaarxiv icon

CAMEx: Curvature-aware Merging of Experts

Add code
Feb 26, 2025
Viaarxiv icon

Tight Clusters Make Specialized Experts

Add code
Feb 21, 2025
Viaarxiv icon

An Attention-based Framework for Fair Contrastive Learning

Add code
Nov 22, 2024
Viaarxiv icon

MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts

Add code
Oct 18, 2024
Viaarxiv icon

A Primal-Dual Framework for Transformers and Neural Networks

Add code
Jun 19, 2024
Viaarxiv icon