Picture for Chak Tou Leong

Chak Tou Leong

SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution

Add code
May 27, 2025
Viaarxiv icon

Scaling over Scaling: Exploring Test-Time Scaling Pareto in Large Reasoning Models

Add code
May 26, 2025
Viaarxiv icon

KNN-SSD: Enabling Dynamic Self-Speculative Decoding via Nearest Neighbor Layer Set Optimization

Add code
May 22, 2025
Viaarxiv icon

Symbolic Representation for Any-to-Any Generative Tasks

Add code
Apr 24, 2025
Viaarxiv icon

Video-Bench: Human-Aligned Video Generation Benchmark

Add code
Apr 07, 2025
Viaarxiv icon

STeCa: Step-level Trajectory Calibration for LLM Agent Learning

Add code
Feb 20, 2025
Viaarxiv icon

Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region

Add code
Feb 19, 2025
Viaarxiv icon

TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Add code
Feb 17, 2025
Viaarxiv icon

Imitate Before Detect: Aligning Machine Stylistic Preference for Machine-Revised Text Detection

Add code
Dec 22, 2024
Viaarxiv icon

Direct Preference Optimization Using Sparse Feature-Level Constraints

Add code
Nov 12, 2024
Figure 1 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 2 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 3 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Figure 4 for Direct Preference Optimization Using Sparse Feature-Level Constraints
Viaarxiv icon