Picture for Chak Tou Leong

Chak Tou Leong

Evaluating Parameter Efficient Methods for RLVR

Add code
Dec 30, 2025
Viaarxiv icon

SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution

Add code
May 27, 2025
Figure 1 for SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution
Figure 2 for SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution
Figure 3 for SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution
Figure 4 for SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution
Viaarxiv icon

Scaling over Scaling: Exploring Test-Time Scaling Pareto in Large Reasoning Models

Add code
May 26, 2025
Viaarxiv icon

KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization

Add code
May 22, 2025
Viaarxiv icon

Symbolic Representation for Any-to-Any Generative Tasks

Add code
Apr 24, 2025
Figure 1 for Symbolic Representation for Any-to-Any Generative Tasks
Figure 2 for Symbolic Representation for Any-to-Any Generative Tasks
Figure 3 for Symbolic Representation for Any-to-Any Generative Tasks
Figure 4 for Symbolic Representation for Any-to-Any Generative Tasks
Viaarxiv icon

Video-Bench: Human-Aligned Video Generation Benchmark

Add code
Apr 07, 2025
Figure 1 for Video-Bench: Human-Aligned Video Generation Benchmark
Figure 2 for Video-Bench: Human-Aligned Video Generation Benchmark
Figure 3 for Video-Bench: Human-Aligned Video Generation Benchmark
Figure 4 for Video-Bench: Human-Aligned Video Generation Benchmark
Viaarxiv icon

STeCa: Step-level Trajectory Calibration for LLM Agent Learning

Add code
Feb 20, 2025
Viaarxiv icon

Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region

Add code
Feb 19, 2025
Figure 1 for Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region
Figure 2 for Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region
Figure 3 for Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region
Figure 4 for Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region
Viaarxiv icon

TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Add code
Feb 17, 2025
Viaarxiv icon

Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection

Add code
Dec 22, 2024
Figure 1 for Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
Figure 2 for Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
Figure 3 for Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
Figure 4 for Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection
Viaarxiv icon