Text


DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Add code
Oct 14, 2025
Viaarxiv icon

UniFusion: Vision-Language Model as Unified Encoder in Image Generation

Add code
Oct 14, 2025
Figure 1 for UniFusion: Vision-Language Model as Unified Encoder in Image Generation
Figure 2 for UniFusion: Vision-Language Model as Unified Encoder in Image Generation
Figure 3 for UniFusion: Vision-Language Model as Unified Encoder in Image Generation
Figure 4 for UniFusion: Vision-Language Model as Unified Encoder in Image Generation
Viaarxiv icon

StyleDecipher: Robust and Explainable Detection of LLM-Generated Texts with Stylistic Analysis

Add code
Oct 14, 2025
Viaarxiv icon

ProtoSiTex: Learning Semi-Interpretable Prototypes for Multi-label Text Classification

Add code
Oct 14, 2025
Viaarxiv icon

SMEC: Rethinking Matryoshka Representation Learning for Retrieval Embedding Compression

Add code
Oct 14, 2025
Viaarxiv icon

MTOS: A LLM-Driven Multi-topic Opinion Simulation Framework for Exploring Echo Chamber Dynamics

Add code
Oct 14, 2025
Viaarxiv icon

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

Add code
Oct 14, 2025
Viaarxiv icon

Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space

Add code
Oct 14, 2025
Viaarxiv icon

A Text-Image Fusion Method with Data Augmentation Capabilities for Referring Medical Image Segmentation

Add code
Oct 14, 2025
Viaarxiv icon

The Role of Parametric Injection-A Systematic Study of Parametric Retrieval-Augmented Generation

Add code
Oct 14, 2025
Viaarxiv icon