Picture for Shixuan Liu

Shixuan Liu

Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security

Add code
Jul 29, 2025
Viaarxiv icon

Group Sequence Policy Optimization

Add code
Jul 24, 2025
Viaarxiv icon

Stable Reinforcement Learning for Efficient Reasoning

Add code
May 23, 2025
Viaarxiv icon

Qwen3 Technical Report

Add code
May 14, 2025
Viaarxiv icon

From Captions to Rewards (CAREVL): Leveraging Large Language Model Experts for Enhanced Reward Modeling in Large Vision-Language Models

Add code
Mar 08, 2025
Viaarxiv icon

A Unified Modeling Framework for Automated Penetration Testing

Add code
Feb 17, 2025
Viaarxiv icon

Graph-attention-based Casual Discovery with Trust Region-navigated Clipping Policy Optimization

Add code
Dec 27, 2024
Viaarxiv icon

Map2Text: New Content Generation from Low-Dimensional Visualizations

Add code
Dec 24, 2024
Viaarxiv icon

$\texttt{dattri}$: A Library for Efficient Data Attribution

Add code
Oct 06, 2024
Figure 1 for $\texttt{dattri}$: A Library for Efficient Data Attribution
Figure 2 for $\texttt{dattri}$: A Library for Efficient Data Attribution
Figure 3 for $\texttt{dattri}$: A Library for Efficient Data Attribution
Figure 4 for $\texttt{dattri}$: A Library for Efficient Data Attribution
Viaarxiv icon

TeleChat Technical Report

Add code
Jan 08, 2024
Figure 1 for TeleChat Technical Report
Figure 2 for TeleChat Technical Report
Figure 3 for TeleChat Technical Report
Figure 4 for TeleChat Technical Report
Viaarxiv icon