Picture for Volker Tresp

Volker Tresp

Edit the Bits, Diff the Codes: Bitwise Residual Editing for Visual Autoregressive Models

Add code
Jun 11, 2026
Viaarxiv icon

TunerDiT: Training-free Progressive Steering of Diffusion Transformer for Multi-Event Video Generation

Add code
May 29, 2026
Viaarxiv icon

EchoRL: Reinforcement Learning via Rollout Echoing

Add code
May 29, 2026
Viaarxiv icon

Memory-R2: Fair Credit Assignment for Long-Horizon Memory-Augmented LLM Agents

Add code
May 20, 2026
Viaarxiv icon

Routing-Free Mixture-of-Experts

Add code
Apr 01, 2026
Viaarxiv icon

Tool Verification for Test-Time Reinforcement Learning

Add code
Mar 02, 2026
Viaarxiv icon

WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents

Add code
Jan 29, 2026
Viaarxiv icon

AUVIC: Adversarial Unlearning of Visual Concepts for Multi-modal Large Language Models

Add code
Nov 14, 2025
Viaarxiv icon

Improving Perturbation-based Explanations by Understanding the Role of Uncertainty Calibration

Add code
Nov 13, 2025
Viaarxiv icon

Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning

Add code
Aug 27, 2025
Figure 1 for Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Figure 2 for Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Figure 3 for Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Figure 4 for Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
Viaarxiv icon