Picture for Mo Yu

Mo Yu

ComoRAG: A Cognitive-Inspired Memory-Organized RAG for Stateful Long Narrative Reasoning

Add code
Aug 14, 2025
Viaarxiv icon

Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings

Add code
Jun 10, 2025
Viaarxiv icon

Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics

Add code
May 01, 2025
Figure 1 for Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
Figure 2 for Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
Figure 3 for Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
Figure 4 for Pushing the Limits of Low-Bit Optimizers: A Focus on EMA Dynamics
Viaarxiv icon

FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset

Add code
Mar 11, 2025
Viaarxiv icon

DBudgetKV: Dynamic Budget in KV Cache Compression for Ensuring Optimal Performance

Add code
Feb 24, 2025
Viaarxiv icon

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Add code
Feb 13, 2025
Figure 1 for The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Figure 2 for The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Figure 3 for The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Figure 4 for The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding
Viaarxiv icon

Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task

Add code
Feb 11, 2025
Figure 1 for Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task
Figure 2 for Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task
Figure 3 for Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task
Figure 4 for Understanding LLMs' Fluid Intelligence Deficiency: An Analysis of the ARC Task
Viaarxiv icon

The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters

Add code
Jan 03, 2025
Figure 1 for The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters
Figure 2 for The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters
Figure 3 for The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters
Figure 4 for The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters
Viaarxiv icon

Large Language Models Can Self-Improve in Long-context Reasoning

Add code
Nov 12, 2024
Figure 1 for Large Language Models Can Self-Improve in Long-context Reasoning
Figure 2 for Large Language Models Can Self-Improve in Long-context Reasoning
Figure 3 for Large Language Models Can Self-Improve in Long-context Reasoning
Figure 4 for Large Language Models Can Self-Improve in Long-context Reasoning
Viaarxiv icon

On the token distance modeling ability of higher RoPE attention dimension

Add code
Oct 11, 2024
Figure 1 for On the token distance modeling ability of higher RoPE attention dimension
Figure 2 for On the token distance modeling ability of higher RoPE attention dimension
Figure 3 for On the token distance modeling ability of higher RoPE attention dimension
Figure 4 for On the token distance modeling ability of higher RoPE attention dimension
Viaarxiv icon