Picture for Pramod Viswanath

Pramod Viswanath

MEMO: Memory-Augmented Model Context Optimization for Robust Multi-Turn Multi-Agent LLM Games

Add code
Mar 09, 2026
Viaarxiv icon

TabularMath: Evaluating Computational Extrapolation in Tabular Learning via Program-Verified Synthesis

Add code
Jan 25, 2026
Viaarxiv icon

FrontierCS: Evolving Challenges for Evolving Intelligence

Add code
Dec 17, 2025
Figure 1 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 2 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 3 for FrontierCS: Evolving Challenges for Evolving Intelligence
Figure 4 for FrontierCS: Evolving Challenges for Evolving Intelligence
Viaarxiv icon

Are Robust LLM Fingerprints Adversarially Robust?

Add code
Sep 30, 2025
Viaarxiv icon

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

Add code
Jun 13, 2025
Viaarxiv icon

CHANCERY: Evaluating corporate governance reasoning capabilities in language models

Add code
Jun 05, 2025
Viaarxiv icon

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Add code
Mar 26, 2025
Viaarxiv icon

AI Agents in Cryptoland: Practical Attacks and No Silver Bullet

Add code
Mar 20, 2025
Viaarxiv icon

SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?

Add code
Mar 16, 2025
Figure 1 for SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
Figure 2 for SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
Figure 3 for SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
Figure 4 for SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?
Viaarxiv icon

Scalable Fingerprinting of Large Language Models

Add code
Feb 11, 2025
Figure 1 for Scalable Fingerprinting of Large Language Models
Figure 2 for Scalable Fingerprinting of Large Language Models
Figure 3 for Scalable Fingerprinting of Large Language Models
Figure 4 for Scalable Fingerprinting of Large Language Models
Viaarxiv icon