Picture for Lingjie Chen

Lingjie Chen

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Figure 1 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 2 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 3 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Figure 4 for SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law
Viaarxiv icon

Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders

Add code
Oct 27, 2024
Figure 1 for Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Figure 2 for Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Figure 3 for Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Figure 4 for Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse Autoencoders
Viaarxiv icon

"A good pun is its own reword": Can Large Language Models Understand Puns?

Add code
Apr 21, 2024
Figure 1 for "A good pun is its own reword": Can Large Language Models Understand Puns?
Figure 2 for "A good pun is its own reword": Can Large Language Models Understand Puns?
Figure 3 for "A good pun is its own reword": Can Large Language Models Understand Puns?
Figure 4 for "A good pun is its own reword": Can Large Language Models Understand Puns?
Viaarxiv icon