Picture for Shaohui Liu

Shaohui Liu

MiMo-V2-Flash Technical Report

Add code
Jan 08, 2026
Viaarxiv icon

MiMo-Audio: Audio Language Models are Few-Shot Learners

Add code
Dec 29, 2025
Viaarxiv icon

Benchmarking Egocentric Visual-Inertial SLAM at City Scale

Add code
Sep 30, 2025
Viaarxiv icon

Segmenting and Understanding: Region-aware Semantic Attention for Fine-grained Image Quality Assessment with Large Language Models

Add code
Aug 11, 2025
Viaarxiv icon

UGD-IML: A Unified Generative Diffusion-based Framework for Constrained and Unconstrained Image Manipulation Localization

Add code
Aug 08, 2025
Viaarxiv icon

Q-CLIP: Unleashing the Power of Vision-Language Models for Video Quality Assessment through Unified Cross-Modal Adaptation

Add code
Aug 08, 2025
Viaarxiv icon

VidText: Towards Comprehensive Evaluation for Video Text Understanding

Add code
May 28, 2025
Viaarxiv icon

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Add code
May 12, 2025
Viaarxiv icon

MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment

Add code
Apr 22, 2025
Viaarxiv icon

MIH-TCCT: Mitigating Inconsistent Hallucinations in LLMs via Event-Driven Text-Code Cyclic Training

Add code
Feb 13, 2025
Viaarxiv icon