Picture for Ameya Prabhu

Ameya Prabhu

Michael Pokorny

Intrinsic Credit Assignment for Long Horizon Interaction

Add code
Feb 12, 2026
Viaarxiv icon

Scaling Open-Ended Reasoning to Predict the Future

Add code
Dec 31, 2025
Viaarxiv icon

Un-Attributability: Computing Novelty From Retrieval & Semantic Similarity

Add code
Oct 31, 2025
Viaarxiv icon

Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols

Add code
Oct 10, 2025
Figure 1 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 2 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 3 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Figure 4 for Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols
Viaarxiv icon

VGGSounder: Audio-Visual Evaluations for Foundation Models

Add code
Aug 12, 2025
Viaarxiv icon

Answer Matching Outperforms Multiple Choice for Language Model Evaluation

Add code
Jul 03, 2025
Viaarxiv icon

A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility

Add code
Apr 09, 2025
Viaarxiv icon

Are We Done with Object-Centric Learning?

Add code
Apr 09, 2025
Figure 1 for Are We Done with Object-Centric Learning?
Figure 2 for Are We Done with Object-Centric Learning?
Figure 3 for Are We Done with Object-Centric Learning?
Figure 4 for Are We Done with Object-Centric Learning?
Viaarxiv icon

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Add code
Feb 26, 2025
Viaarxiv icon

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Add code
Feb 26, 2025
Figure 1 for Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs
Figure 2 for Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs
Figure 3 for Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs
Figure 4 for Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs
Viaarxiv icon