Picture for Thales Sales Almeida

Thales Sales Almeida

Ticket-Bench: A Kickoff for Multilingual and Regionalized Agent Evaluation

Add code
Sep 17, 2025
Viaarxiv icon

Building High-Quality Datasets for Portuguese LLMs: From Common Crawl Snapshots to Industrial-Grade Corpora

Add code
Sep 10, 2025
Viaarxiv icon

BRoverbs -- Measuring how much LLMs understand Portuguese proverbs

Add code
Sep 10, 2025
Figure 1 for BRoverbs -- Measuring how much LLMs understand Portuguese proverbs
Figure 2 for BRoverbs -- Measuring how much LLMs understand Portuguese proverbs
Figure 3 for BRoverbs -- Measuring how much LLMs understand Portuguese proverbs
Figure 4 for BRoverbs -- Measuring how much LLMs understand Portuguese proverbs
Viaarxiv icon

TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models

Add code
Jan 13, 2025
Figure 1 for TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models
Figure 2 for TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models
Figure 3 for TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models
Figure 4 for TiEBe: A Benchmark for Assessing the Current Knowledge of Large Language Models
Viaarxiv icon

The interplay between domain specialization and model size: a case study in the legal domain

Add code
Jan 03, 2025
Viaarxiv icon

Sabiá-3 Technical Report

Add code
Oct 15, 2024
Viaarxiv icon

SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section

Add code
Aug 29, 2024
Figure 1 for SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section
Figure 2 for SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section
Figure 3 for SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section
Figure 4 for SurveySum: A Dataset for Summarizing Multiple Scientific Articles into a Survey Section
Viaarxiv icon

Measuring Cross-lingual Transfer in Bytes

Add code
Apr 12, 2024
Viaarxiv icon

Sabiá-2: A New Generation of Portuguese Large Language Models

Add code
Mar 26, 2024
Viaarxiv icon

Evaluating GPT-4's Vision Capabilities on Brazilian University Admission Exams

Add code
Nov 23, 2023
Viaarxiv icon