Picture for Christof Monz

Christof Monz

Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training

Add code
Oct 06, 2025
Viaarxiv icon

Best-of-L: Cross-Lingual Reward Modeling for Mathematical Reasoning

Add code
Sep 19, 2025
Viaarxiv icon

Please Translate Again: Two Simple Experiments on Whether Human-Like Reasoning Helps Translation

Add code
Jun 05, 2025
Figure 1 for Please Translate Again: Two Simple Experiments on Whether Human-Like Reasoning Helps Translation
Figure 2 for Please Translate Again: Two Simple Experiments on Whether Human-Like Reasoning Helps Translation
Figure 3 for Please Translate Again: Two Simple Experiments on Whether Human-Like Reasoning Helps Translation
Figure 4 for Please Translate Again: Two Simple Experiments on Whether Human-Like Reasoning Helps Translation
Viaarxiv icon

Fractured Chain-of-Thought Reasoning

Add code
May 19, 2025
Viaarxiv icon

The Effect of Language Diversity When Fine-Tuning Large Language Models for Translation

Add code
May 19, 2025
Viaarxiv icon

What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs

Add code
May 15, 2025
Viaarxiv icon

Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation

Add code
May 09, 2025
Figure 1 for Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation
Figure 2 for Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation
Figure 3 for Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation
Figure 4 for Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation
Viaarxiv icon

Calibrating Translation Decoding with Quality Estimation on LLMs

Add code
Apr 26, 2025
Figure 1 for Calibrating Translation Decoding with Quality Estimation on LLMs
Figure 2 for Calibrating Translation Decoding with Quality Estimation on LLMs
Figure 3 for Calibrating Translation Decoding with Quality Estimation on LLMs
Figure 4 for Calibrating Translation Decoding with Quality Estimation on LLMs
Viaarxiv icon

Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling

Add code
Apr 18, 2025
Figure 1 for Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
Figure 2 for Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
Figure 3 for Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
Figure 4 for Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
Viaarxiv icon

ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning

Add code
Mar 17, 2025
Figure 1 for ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning
Figure 2 for ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning
Figure 3 for ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning
Figure 4 for ClusComp: A Simple Paradigm for Model Compression and Efficient Finetuning
Viaarxiv icon