Picture for Jie Fu

Jie Fu

University of the Arts London, Creative Computing Institute, London, United Kingdom

The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution

Add code
Jan 21, 2026
Viaarxiv icon

Weights to Code: Extracting Interpretable Algorithms from the Discrete Transformer

Add code
Jan 09, 2026
Viaarxiv icon

Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective

Add code
Jun 16, 2025
Figure 1 for Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective
Figure 2 for Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective
Figure 3 for Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective
Figure 4 for Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective
Viaarxiv icon

Thompson Sampling in Online RLHF with General Function Approximation

Add code
May 29, 2025
Figure 1 for Thompson Sampling in Online RLHF with General Function Approximation
Viaarxiv icon

Thinker: Learning to Think Fast and Slow

Add code
May 27, 2025
Figure 1 for Thinker: Learning to Think Fast and Slow
Figure 2 for Thinker: Learning to Think Fast and Slow
Figure 3 for Thinker: Learning to Think Fast and Slow
Figure 4 for Thinker: Learning to Think Fast and Slow
Viaarxiv icon

Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons

Add code
May 23, 2025
Figure 1 for Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons
Figure 2 for Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons
Figure 3 for Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons
Figure 4 for Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons
Viaarxiv icon

NeuralGrok: Accelerate Grokking by Neural Gradient Transformation

Add code
Apr 24, 2025
Viaarxiv icon

Learning from Failures in Multi-Attempt Reinforcement Learning

Add code
Mar 04, 2025
Viaarxiv icon

Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking

Add code
Feb 27, 2025
Figure 1 for Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking
Figure 2 for Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking
Figure 3 for Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking
Figure 4 for Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking
Viaarxiv icon

Generating Symbolic World Models via Test-time Scaling of Large Language Models

Add code
Feb 07, 2025
Figure 1 for Generating Symbolic World Models via Test-time Scaling of Large Language Models
Figure 2 for Generating Symbolic World Models via Test-time Scaling of Large Language Models
Figure 3 for Generating Symbolic World Models via Test-time Scaling of Large Language Models
Figure 4 for Generating Symbolic World Models via Test-time Scaling of Large Language Models
Viaarxiv icon