Picture for Jie Fu

Jie Fu

University of the Arts London, Creative Computing Institute, London, United Kingdom

Thompson Sampling in Online RLHF with General Function Approximation

Add code
May 29, 2025
Viaarxiv icon

Thinker: Learning to Think Fast and Slow

Add code
May 27, 2025
Viaarxiv icon

Automata Learning of Preferences over Temporal Logic Formulas from Pairwise Comparisons

Add code
May 23, 2025
Viaarxiv icon

NeuralGrok: Accelerate Grokking by Neural Gradient Transformation

Add code
Apr 24, 2025
Viaarxiv icon

Learning from Failures in Multi-Attempt Reinforcement Learning

Add code
Mar 04, 2025
Viaarxiv icon

Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking

Add code
Feb 27, 2025
Viaarxiv icon

Generating Symbolic World Models via Test-time Scaling of Large Language Models

Add code
Feb 07, 2025
Viaarxiv icon

CBNN: 3-Party Secure Framework for Customized Binary Neural Networks Inference

Add code
Dec 21, 2024
Viaarxiv icon

Reactive Synthesis of Sensor Revealing Strategies in Hypergames on Graphs

Add code
Dec 02, 2024
Viaarxiv icon

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Add code
Nov 07, 2024
Figure 1 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 2 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 3 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Figure 4 for OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models
Viaarxiv icon