Picture for Anh Tuan Luu

Anh Tuan Luu

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Add code
Jan 15, 2026
Viaarxiv icon

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Add code
Jan 13, 2026
Viaarxiv icon

MRMR: A Realistic and Expert-Level Multidisciplinary Benchmark for Reasoning-Intensive Multimodal Retrieval

Add code
Oct 10, 2025
Viaarxiv icon

Rethinking Reasoning: A Survey on Reasoning-based Backdoors in LLMs

Add code
Oct 09, 2025
Viaarxiv icon

P2P: A Poison-to-Poison Remedy for Reliable Backdoor Defense in LLMs

Add code
Oct 06, 2025
Viaarxiv icon

Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning

Add code
Jun 10, 2025
Figure 1 for Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Figure 2 for Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Figure 3 for Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Figure 4 for Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning
Viaarxiv icon

ClozeMath: Improving Mathematical Reasoning in Language Models by Learning to Fill Equations

Add code
Jun 04, 2025
Viaarxiv icon

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Add code
May 21, 2025
Figure 1 for Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Figure 2 for Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Figure 3 for Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Figure 4 for Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Viaarxiv icon

SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation

Add code
May 20, 2025
Figure 1 for SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
Figure 2 for SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
Figure 3 for SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
Figure 4 for SCOPE: Compress Mathematical Reasoning Steps for Efficient Automated Process Annotation
Viaarxiv icon

Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation

Add code
Apr 22, 2025
Viaarxiv icon