Picture for Roy Ka-Wei Lee

Roy Ka-Wei Lee

Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages

Add code
Sep 18, 2025
Figure 1 for Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages
Figure 2 for Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages
Figure 3 for Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages
Figure 4 for Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore's Low-Resource Languages
Viaarxiv icon

Persuasion Dynamics in LLMs: Investigating Robustness and Adaptability in Knowledge and Safety with DuET-PD

Add code
Aug 24, 2025
Viaarxiv icon

BGM-HAN: A Hierarchical Attention Network for Accurate and Fair Decision Assessment on Semi-Structured Profiles

Add code
Jul 23, 2025
Viaarxiv icon

Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation

Add code
Jul 16, 2025
Viaarxiv icon

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Add code
Jun 23, 2025
Viaarxiv icon

"Is This Really a Human Peer Supporter?": Misalignments Between Peer Supporters and Experts in LLM-Supported Interactions

Add code
Jun 11, 2025
Viaarxiv icon

Sword and Shield: Uses and Strategies of LLMs in Navigating Disinformation

Add code
Jun 08, 2025
Viaarxiv icon

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Add code
Jun 04, 2025
Figure 1 for SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Figure 2 for SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Figure 3 for SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Figure 4 for SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Viaarxiv icon

Understanding Refusal in Language Models with Sparse Autoencoders

Add code
May 29, 2025
Viaarxiv icon

Resolving Conflicting Evidence in Automated Fact-Checking: A Study on Retrieval-Augmented LLMs

Add code
May 23, 2025
Viaarxiv icon