Picture for Songlin Hu

Songlin Hu

An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding

Add code
Mar 06, 2025
Figure 1 for An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding
Figure 2 for An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding
Figure 3 for An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding
Figure 4 for An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding
Viaarxiv icon

DSMoE: Matrix-Partitioned Experts with Dynamic Routing for Computation-Efficient Dense LLMs

Add code
Feb 18, 2025
Viaarxiv icon

Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media

Add code
Dec 04, 2024
Figure 1 for Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media
Figure 2 for Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media
Figure 3 for Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media
Figure 4 for Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media
Viaarxiv icon

The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models

Add code
Nov 18, 2024
Figure 1 for The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Figure 2 for The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Figure 3 for The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Figure 4 for The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
Viaarxiv icon

CartesianMoE: Boosting Knowledge Sharing among Experts via Cartesian Product Routing in Mixture-of-Experts

Add code
Oct 21, 2024
Viaarxiv icon

CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning

Add code
Oct 03, 2024
Figure 1 for CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning
Figure 2 for CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning
Figure 3 for CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning
Figure 4 for CodePMP: Scalable Preference Model Pretraining for Large Language Model Reasoning
Viaarxiv icon

Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction

Add code
Sep 25, 2024
Figure 1 for Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
Figure 2 for Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
Figure 3 for Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
Figure 4 for Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
Viaarxiv icon

AdaPPA: Adaptive Position Pre-Fill Jailbreak Attack Approach Targeting LLMs

Add code
Sep 11, 2024
Figure 1 for AdaPPA: Adaptive Position Pre-Fill Jailbreak Attack Approach Targeting LLMs
Figure 2 for AdaPPA: Adaptive Position Pre-Fill Jailbreak Attack Approach Targeting LLMs
Figure 3 for AdaPPA: Adaptive Position Pre-Fill Jailbreak Attack Approach Targeting LLMs
Figure 4 for AdaPPA: Adaptive Position Pre-Fill Jailbreak Attack Approach Targeting LLMs
Viaarxiv icon

Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval

Add code
Aug 20, 2024
Figure 1 for Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Figure 2 for Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Figure 3 for Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Figure 4 for Task-level Distributionally Robust Optimization for Large Language Model-based Dense Retrieval
Viaarxiv icon

MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts

Add code
Jul 13, 2024
Viaarxiv icon