Picture for Shyam Upadhyay

Shyam Upadhyay

Shammie

Do LLMs Really Need 10+ Thoughts for "Find the Time 1000 Days Later"? Towards Structural Understanding of LLM Overthinking

Add code
Oct 09, 2025
Viaarxiv icon

Vibe Checker: Aligning Code Evaluation with Human Preference

Add code
Oct 08, 2025
Figure 1 for Vibe Checker: Aligning Code Evaluation with Human Preference
Figure 2 for Vibe Checker: Aligning Code Evaluation with Human Preference
Figure 3 for Vibe Checker: Aligning Code Evaluation with Human Preference
Figure 4 for Vibe Checker: Aligning Code Evaluation with Human Preference
Viaarxiv icon

Multi-Turn Puzzles: Evaluating Interactive Reasoning and Strategic Dialogue in LLMs

Add code
Aug 13, 2025
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

AutoMix: Automatically Mixing Language Models

Add code
Oct 19, 2023
Figure 1 for AutoMix: Automatically Mixing Language Models
Figure 2 for AutoMix: Automatically Mixing Language Models
Figure 3 for AutoMix: Automatically Mixing Language Models
Figure 4 for AutoMix: Automatically Mixing Language Models
Viaarxiv icon

How FaR Are Large Language Models From Agents with Theory-of-Mind?

Add code
Oct 04, 2023
Figure 1 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 2 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 3 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Figure 4 for How FaR Are Large Language Models From Agents with Theory-of-Mind?
Viaarxiv icon

Efficient Encoders for Streaming Sequence Tagging

Add code
Jan 23, 2023
Figure 1 for Efficient Encoders for Streaming Sequence Tagging
Figure 2 for Efficient Encoders for Streaming Sequence Tagging
Figure 3 for Efficient Encoders for Streaming Sequence Tagging
Figure 4 for Efficient Encoders for Streaming Sequence Tagging
Viaarxiv icon

CST5: Data Augmentation for Code-Switched Semantic Parsing

Add code
Nov 14, 2022
Figure 1 for CST5: Data Augmentation for Code-Switched Semantic Parsing
Figure 2 for CST5: Data Augmentation for Code-Switched Semantic Parsing
Figure 3 for CST5: Data Augmentation for Code-Switched Semantic Parsing
Figure 4 for CST5: Data Augmentation for Code-Switched Semantic Parsing
Viaarxiv icon

Streaming Intended Query Detection using E2E Modeling for Continued Conversation

Add code
Aug 29, 2022
Figure 1 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 2 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 3 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 4 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon