Picture for Trung Le

Trung Le

TokenRatio: Principled Token-Level Preference Optimization via Ratio Matching

Add code
May 14, 2026
Viaarxiv icon

Spectral Flattening Is All Muon Needs: How Orthogonalization Controls Learning Rate and Convergence

Add code
May 13, 2026
Viaarxiv icon

Selective Off-Policy Reference Tuning with Plan Guidance

Add code
May 13, 2026
Viaarxiv icon

BSO: Safety Alignment Is Density Ratio Matching

Add code
May 12, 2026
Viaarxiv icon

LLM-XTM: Enhancing Cross-Lingual Topic Models with Large Language Models

Add code
May 05, 2026
Viaarxiv icon

Diverse Image Priors for Black-box Data-free Knowledge Distillation

Add code
Apr 28, 2026
Viaarxiv icon

MIPIC: Matryoshka Representation Learning via Self-Distilled Intra-Relational and Progressive Information Chaining

Add code
Apr 27, 2026
Viaarxiv icon

Test-Time Instance-Specific Parameter Composition: A New Paradigm for Adaptive Generative Modeling

Add code
Mar 29, 2026
Viaarxiv icon

Sharpness-Aware Minimization in Logit Space Efficiently Enhances Direct Preference Optimization

Add code
Mar 18, 2026
Viaarxiv icon

Antibody: Strengthening Defense Against Harmful Fine-Tuning for Large Language Models via Attenuating Harmful Gradient Influence

Add code
Feb 28, 2026
Viaarxiv icon