Picture for Gurusha Juneja

Gurusha Juneja

EnactToM: An Evolving Benchmark for Functional Theory of Mind in Embodied Agents

Add code
May 11, 2026
Viaarxiv icon

Policies Permitting LLM Use for Polishing Peer Reviews Are Currently Not Enforceable

Add code
Mar 20, 2026
Viaarxiv icon

MAGPIE: A dataset for Multi-AGent contextual PrIvacy Evaluation

Add code
Jun 25, 2025
Viaarxiv icon

Task Facet Learning: A Structured Approach to Prompt Optimization

Add code
Jun 15, 2024
Figure 1 for Task Facet Learning: A Structured Approach to Prompt Optimization
Figure 2 for Task Facet Learning: A Structured Approach to Prompt Optimization
Figure 3 for Task Facet Learning: A Structured Approach to Prompt Optimization
Figure 4 for Task Facet Learning: A Structured Approach to Prompt Optimization
Viaarxiv icon

$\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning

Add code
Apr 02, 2024
Figure 1 for $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
Figure 2 for $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
Figure 3 for $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
Figure 4 for $\texttt{LM}^\texttt{2}$: A Simple Society of Language Models Solves Complex Reasoning
Viaarxiv icon

Prompt-Propose-Verify: A Reliable Hand-Object-Interaction Data Generation Framework using Foundational Models

Add code
Dec 23, 2023
Viaarxiv icon

Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning

Add code
Oct 21, 2023
Figure 1 for Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Figure 2 for Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Figure 3 for Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Figure 4 for Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning
Viaarxiv icon