Picture for Brando Miranda

Brando Miranda

The Easy, the Hard, and the Learnable: Confidence and Difficulty-Adaptive Policy Optimization for LLM Reasoning

Add code
Jun 06, 2026
Viaarxiv icon

Quantifying the Effect of Test Set Contamination on Generative Evaluations

Add code
Jan 07, 2026
Viaarxiv icon

Lean-ing on Quality: How High-Quality Data Beats Diverse Multilingual Data in AutoFormalization

Add code
Feb 18, 2025
Viaarxiv icon

Exploring the Efficacy of Meta-Learning: Unveiling Superior Data Diversity Utilization of MAML Over Pre-training

Add code
Jan 15, 2025
Viaarxiv icon

Quantifying the Importance of Data Alignment in Downstream Model Performance

Add code
Jan 14, 2025
Figure 1 for Quantifying the Importance of Data Alignment in Downstream Model Performance
Figure 2 for Quantifying the Importance of Data Alignment in Downstream Model Performance
Figure 3 for Quantifying the Importance of Data Alignment in Downstream Model Performance
Figure 4 for Quantifying the Importance of Data Alignment in Downstream Model Performance
Viaarxiv icon

ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment

Add code
Oct 23, 2024
Figure 1 for ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment
Figure 2 for ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment
Figure 3 for ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment
Figure 4 for ZIP-FIT: Embedding-Free Data Selection via Compression-Based Alignment
Viaarxiv icon

Pantograph: A Machine-to-Machine Interaction Interface for Advanced Theorem Proving, High Level Reasoning, and Data Extraction in Lean 4

Add code
Oct 21, 2024
Viaarxiv icon

When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?

Add code
Jul 21, 2024
Figure 1 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 2 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 3 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Figure 4 for When Do Universal Image Jailbreaks Transfer Between Vision-Language Models?
Viaarxiv icon

Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?

Add code
Jun 06, 2024
Figure 1 for Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
Figure 2 for Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
Figure 3 for Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
Figure 4 for Why Has Predicting Downstream Capabilities of Frontier AI Models with Scale Remained Elusive?
Viaarxiv icon

An Evaluation Benchmark for Autoformalization in Lean4

Add code
Jun 01, 2024
Figure 1 for An Evaluation Benchmark for Autoformalization in Lean4
Figure 2 for An Evaluation Benchmark for Autoformalization in Lean4
Viaarxiv icon