Picture for Federico Bianchi

Federico Bianchi

Harnessing the Collective Intelligence of AI Agents in the Wild for New Discoveries

Add code
Jun 09, 2026
Viaarxiv icon

Automated Benchmark Auditing for AI Agents and Large Language Models

Add code
May 26, 2026
Viaarxiv icon

Evaluating Commercial AI Chatbots as News Intermediaries

Add code
May 21, 2026
Viaarxiv icon

"Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most

Add code
Feb 16, 2026
Viaarxiv icon

Making Databases Faster with LLM Evolutionary Sampling

Add code
Feb 11, 2026
Viaarxiv icon

DSGym: A Holistic Framework for Evaluating and Training Data Science Agents

Add code
Jan 22, 2026
Viaarxiv icon

Learning to Discover at Test Time

Add code
Jan 22, 2026
Viaarxiv icon

Exploring the use of AI authors and reviewers at Agents4Science

Add code
Nov 19, 2025
Viaarxiv icon

Labeling Messages as AI-Generated Does Not Reduce Their Persuasive Effects

Add code
Apr 14, 2025
Figure 1 for Labeling Messages as AI-Generated Does Not Reduce Their Persuasive Effects
Figure 2 for Labeling Messages as AI-Generated Does Not Reduce Their Persuasive Effects
Figure 3 for Labeling Messages as AI-Generated Does Not Reduce Their Persuasive Effects
Figure 4 for Labeling Messages as AI-Generated Does Not Reduce Their Persuasive Effects
Viaarxiv icon

Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory

Add code
Apr 10, 2025
Viaarxiv icon