Picture for Rodrigo Nogueira

Rodrigo Nogueira

CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

Add code
Mar 23, 2026
Viaarxiv icon

Sabiá-4 Technical Report

Add code
Mar 10, 2026
Viaarxiv icon

Curió-Edu 7B: Examining Data Selection Impacts in LLM Continued Pretraining

Add code
Dec 14, 2025
Viaarxiv icon

Building High-Quality Datasets for Portuguese LLMs: From Common Crawl Snapshots to Industrial-Grade Corpora

Add code
Sep 10, 2025
Viaarxiv icon

Comparing Knowledge Injection Methods for LLMs in a Low-Resource Regime

Add code
Aug 08, 2025
Viaarxiv icon

Automatic Legal Writing Evaluation of LLMs

Add code
Apr 29, 2025
Viaarxiv icon

TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models

Add code
Jan 13, 2025
Figure 1 for TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models
Figure 2 for TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models
Figure 3 for TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models
Figure 4 for TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models
Viaarxiv icon

The interplay between domain specialization and model size: a case study in the legal domain

Add code
Jan 03, 2025
Viaarxiv icon

Sabiá-3 Technical Report

Add code
Oct 15, 2024
Viaarxiv icon

MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks

Add code
Oct 08, 2024
Figure 1 for MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks
Figure 2 for MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks
Figure 3 for MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks
Figure 4 for MLissard: Multilingual Long and Simple Sequential Reasoning Benchmarks
Viaarxiv icon