Picture for Weisen Jiang

Weisen Jiang

MetaMoE: Diversity-Aware Proxy Selection for Privacy-Preserving Mixture-of-Experts Unification

Add code
May 14, 2026
Viaarxiv icon

RxEval: A Prescription-Level Benchmark for Evaluating LLM Medication Recommendation

Add code
May 14, 2026
Viaarxiv icon

MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation

Add code
Oct 09, 2025
Figure 1 for MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
Figure 2 for MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
Figure 3 for MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
Figure 4 for MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
Viaarxiv icon

PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model

Add code
May 06, 2025
Figure 1 for PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model
Figure 2 for PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model
Figure 3 for PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model
Figure 4 for PARM: Multi-Objective Test-Time Alignment via Preference-Aware Autoregressive Reward Model
Viaarxiv icon

RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models

Add code
Sep 30, 2024
Figure 1 for RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Figure 2 for RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Figure 3 for RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Figure 4 for RouterDC: Query-Based Router by Dual Contrastive Learning for Assembling Large Language Models
Viaarxiv icon

Enhancing Sharpness-Aware Minimization by Learning Perturbation Radius

Add code
Aug 15, 2024
Viaarxiv icon

Scalable Learned Model Soup on a Single GPU: An Efficient Subspace Training Strategy

Add code
Jul 04, 2024
Figure 1 for Scalable Learned Model Soup on a Single GPU: An Efficient Subspace Training Strategy
Figure 2 for Scalable Learned Model Soup on a Single GPU: An Efficient Subspace Training Strategy
Figure 3 for Scalable Learned Model Soup on a Single GPU: An Efficient Subspace Training Strategy
Figure 4 for Scalable Learned Model Soup on a Single GPU: An Efficient Subspace Training Strategy
Viaarxiv icon

MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders

Add code
Jul 02, 2024
Viaarxiv icon

Rendering Graphs for Graph Reasoning in Multimodal Large Language Models

Add code
Feb 03, 2024
Viaarxiv icon

Large Language Models as Visual Cross-Domain Learners

Add code
Jan 06, 2024
Figure 1 for Large Language Models as Visual Cross-Domain Learners
Figure 2 for Large Language Models as Visual Cross-Domain Learners
Figure 3 for Large Language Models as Visual Cross-Domain Learners
Figure 4 for Large Language Models as Visual Cross-Domain Learners
Viaarxiv icon