Picture for Jiahao Huo

Jiahao Huo

EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next

Add code
Mar 12, 2026
Viaarxiv icon

Beyond the Grid: Layout-Informed Multi-Vector Retrieval with Parsed Visual Document Representations

Add code
Mar 02, 2026
Viaarxiv icon

Unlocking Multimodal Document Intelligence: From Current Triumphs to Future Frontiers of Visual Document Retrieval

Add code
Feb 23, 2026
Viaarxiv icon

Sculpting the Vector Space: Towards Efficient Multi-Vector Visual Document Retrieval via Prune-then-Merge Framework

Add code
Feb 23, 2026
Viaarxiv icon

CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding

Add code
Jan 29, 2026
Viaarxiv icon

Memory in the Age of AI Agents

Add code
Dec 15, 2025
Viaarxiv icon

Improving Wildlife Out-of-Distribution Detection: Africas Big Five

Add code
Jun 07, 2025
Viaarxiv icon

MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models

Add code
May 28, 2025
Figure 1 for MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models
Figure 2 for MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models
Figure 3 for MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models
Figure 4 for MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models
Viaarxiv icon

Pierce the Mists, Greet the Sky: Decipher Knowledge Overshadowing via Knowledge Circuit Analysis

Add code
May 21, 2025
Viaarxiv icon

EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models

Add code
Feb 17, 2025
Viaarxiv icon