Picture for Arman Cohan

Arman Cohan

Judging with Many Minds: Do More Perspectives Mean Less Prejudice?

Add code
May 26, 2025
Viaarxiv icon

Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective

Add code
May 21, 2025
Figure 1 for Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Figure 2 for Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Figure 3 for Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Figure 4 for Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective
Viaarxiv icon

Towards Artificial Intelligence Research Assistant for Expert-Involved Learning

Add code
May 03, 2025
Viaarxiv icon

IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery

Add code
Apr 23, 2025
Figure 1 for IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery
Figure 2 for IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery
Figure 3 for IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery
Figure 4 for IRIS: Interactive Research Ideation System for Accelerating Scientific Discovery
Viaarxiv icon

YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization

Add code
Apr 04, 2025
Figure 1 for YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization
Figure 2 for YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization
Figure 3 for YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization
Figure 4 for YaleNLP @ PerAnsSumm 2025: Multi-Perspective Integration via Mixture-of-Agents for Enhanced Healthcare QA Summarization
Viaarxiv icon

Z1: Efficient Test-time Scaling with Code

Add code
Apr 01, 2025
Figure 1 for Z1: Efficient Test-time Scaling with Code
Figure 2 for Z1: Efficient Test-time Scaling with Code
Figure 3 for Z1: Efficient Test-time Scaling with Code
Figure 4 for Z1: Efficient Test-time Scaling with Code
Viaarxiv icon

MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search

Add code
Mar 26, 2025
Figure 1 for MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search
Figure 2 for MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search
Figure 3 for MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search
Figure 4 for MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search
Viaarxiv icon

Survey on Evaluation of LLM-based Agents

Add code
Mar 20, 2025
Viaarxiv icon

LocAgent: Graph-Guided LLM Agents for Code Localization

Add code
Mar 12, 2025
Figure 1 for LocAgent: Graph-Guided LLM Agents for Code Localization
Figure 2 for LocAgent: Graph-Guided LLM Agents for Code Localization
Figure 3 for LocAgent: Graph-Guided LLM Agents for Code Localization
Figure 4 for LocAgent: Graph-Guided LLM Agents for Code Localization
Viaarxiv icon

MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Add code
Mar 10, 2025
Figure 1 for MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
Figure 2 for MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
Figure 3 for MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
Figure 4 for MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
Viaarxiv icon