Picture for Maximilian Mozes

Maximilian Mozes

Reverse Engineering Human Preferences with Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

Command A: An Enterprise-Ready Large Language Model

Add code
Apr 01, 2025
Viaarxiv icon

LLMs can implicitly learn from mistakes in-context

Add code
Feb 12, 2025
Viaarxiv icon

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models

Add code
Nov 19, 2024
Figure 1 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 2 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 3 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Figure 4 for Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
Viaarxiv icon

Here's a Free Lunch: Sanitizing Backdoored Models with Model Merge

Add code
Feb 29, 2024
Viaarxiv icon

Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities

Add code
Aug 24, 2023
Viaarxiv icon

Challenges and Applications of Large Language Models

Add code
Jul 19, 2023
Viaarxiv icon

Susceptibility to Influence of Large Language Models

Add code
Mar 10, 2023
Viaarxiv icon

Gradient-Based Automated Iterative Recovery for Parameter-Efficient Tuning

Add code
Feb 13, 2023
Viaarxiv icon

Towards Agile Text Classifiers for Everyone

Add code
Feb 13, 2023
Viaarxiv icon