Picture for Sumeet Ramesh Motwani

Sumeet Ramesh Motwani

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning

Add code
Apr 15, 2026
Viaarxiv icon

h1: Bootstrapping LLMs to Reason over Longer Horizons via Reinforcement Learning

Add code
Oct 08, 2025
Viaarxiv icon

MALT: Improving Reasoning with Multi-Agent LLM Training

Add code
Dec 02, 2024
Figure 1 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 2 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 3 for MALT: Improving Reasoning with Multi-Agent LLM Training
Viaarxiv icon

Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits

Add code
Jun 03, 2024
Figure 1 for Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
Figure 2 for Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
Figure 3 for Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
Figure 4 for Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
Viaarxiv icon

Secret Collusion Among Generative AI Agents

Add code
Feb 12, 2024
Figure 1 for Secret Collusion Among Generative AI Agents
Figure 2 for Secret Collusion Among Generative AI Agents
Figure 3 for Secret Collusion Among Generative AI Agents
Figure 4 for Secret Collusion Among Generative AI Agents
Viaarxiv icon

STARC: A General Framework For Quantifying Differences Between Reward Functions

Add code
Sep 26, 2023
Figure 1 for STARC: A General Framework For Quantifying Differences Between Reward Functions
Viaarxiv icon