Picture for Frederic Sala

Frederic Sala

R&B: Domain Regrouping and Data Mixture Balancing for Efficient Foundation Model Training

Add code
May 01, 2025
Viaarxiv icon

COSMOS: Predictable and Cost-Effective Adaptation of LLMs

Add code
Apr 30, 2025
Viaarxiv icon

TARDIS: Mitigating Temporal Misalignment via Representation Steering

Add code
Mar 25, 2025
Viaarxiv icon

Personalize Your LLM: Fake it then Align it

Add code
Mar 05, 2025
Viaarxiv icon

Tabby: Tabular Data Synthesis with Language Models

Add code
Mar 04, 2025
Viaarxiv icon

Theoretical Physics Benchmark (TPBench) -- a Dataset and Study of AI Reasoning Capabilities in Theoretical Physics

Add code
Feb 19, 2025
Viaarxiv icon

ScriptoriumWS: A Code Generation Assistant for Weak Supervision

Add code
Feb 17, 2025
Viaarxiv icon

Stronger Than You Think: Benchmarking Weak Supervision on Realistic Tasks

Add code
Jan 13, 2025
Viaarxiv icon

Evaluating Sample Utility for Data Selection by Mimicking Model Weights

Add code
Jan 12, 2025
Figure 1 for Evaluating Sample Utility for Data Selection by Mimicking Model Weights
Figure 2 for Evaluating Sample Utility for Data Selection by Mimicking Model Weights
Figure 3 for Evaluating Sample Utility for Data Selection by Mimicking Model Weights
Figure 4 for Evaluating Sample Utility for Data Selection by Mimicking Model Weights
Viaarxiv icon

Weak-to-Strong Generalization Through the Data-Centric Lens

Add code
Dec 05, 2024
Viaarxiv icon