Picture for Yuanhuiyi Lyu

Yuanhuiyi Lyu

MLLMs are Deeply Affected by Modality Bias

Add code
May 24, 2025
Viaarxiv icon

Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?

Add code
May 17, 2025
Viaarxiv icon

Reducing Unimodal Bias in Multi-Modal Semantic Segmentation with Multi-Scale Functional Entropy Regularization

Add code
May 10, 2025
Viaarxiv icon

DiMeR: Disentangled Mesh Reconstruction Model

Add code
Apr 24, 2025
Viaarxiv icon

OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation

Add code
Mar 10, 2025
Viaarxiv icon

From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers

Add code
Mar 10, 2025
Viaarxiv icon

MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation

Add code
Mar 09, 2025
Viaarxiv icon

RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning

Add code
Feb 02, 2025
Figure 1 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 2 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 3 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Figure 4 for RealRAG: Retrieval-augmented Realistic Image Generation via Self-reflective Contrastive Learning
Viaarxiv icon

MAGIC++: Efficient and Resilient Modality-Agnostic Semantic Segmentation via Hierarchical Modality Selection

Add code
Dec 22, 2024
Viaarxiv icon

A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges

Add code
Dec 16, 2024
Viaarxiv icon