Picture for Giuseppe Carenini

Giuseppe Carenini

University of British Columbia

MCompassRAG: Topic Metadata as a Semantic Compass for Paragraph-Level Retrieval

Add code
Jun 16, 2026
Viaarxiv icon

SproutRAG: Attention-Guided Tree Search with Progressive Embeddings for Long-Document RAG

Add code
Jun 16, 2026
Viaarxiv icon

The Illusion of Multi-Agent Advantage

Add code
Jun 11, 2026
Viaarxiv icon

UnpredictaBench: A Benchmark for Evaluating Distributional Randomness in LLMs

Add code
Jun 04, 2026
Viaarxiv icon

VAMPS: Visual-Assisted Mathematical Problem Solving Benchmark

Add code
Jun 02, 2026
Viaarxiv icon

When Minor Edits Matter: LLM-Driven Prompt Attack for Medical VLM Robustness in Ultrasound

Add code
Mar 22, 2026
Viaarxiv icon

BeDiscovER: The Benchmark of Discourse Understanding in the Era of Reasoning Language Models

Add code
Nov 17, 2025
Figure 1 for BeDiscovER: The Benchmark of Discourse Understanding in the Era of Reasoning Language Models
Figure 2 for BeDiscovER: The Benchmark of Discourse Understanding in the Era of Reasoning Language Models
Figure 3 for BeDiscovER: The Benchmark of Discourse Understanding in the Era of Reasoning Language Models
Figure 4 for BeDiscovER: The Benchmark of Discourse Understanding in the Era of Reasoning Language Models
Viaarxiv icon

ChartGaze: Enhancing Chart Understanding in LVLMs with Eye-Tracking Guided Attention Refinement

Add code
Sep 16, 2025
Viaarxiv icon

SMARTAPS: Tool-augmented LLMs for Operations Management

Add code
Jul 23, 2025
Viaarxiv icon

SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents

Add code
Jun 09, 2025
Figure 1 for SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents
Figure 2 for SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents
Figure 3 for SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents
Figure 4 for SOP-Bench: Complex Industrial SOPs for Evaluating LLM Agents
Viaarxiv icon