Text


Rethinking the Text-Vision Reasoning Imbalance in MLLMs through the Lens of Training Recipes

Add code
Oct 26, 2025
Viaarxiv icon

Semi-supervised Vertex Hunting, with Applications in Network and Text Analysis

Add code
Oct 26, 2025
Viaarxiv icon

Multi-Modal Masked Autoencoders for Learning Image-Spectrum Associations for Galaxy Evolution and Cosmology

Add code
Oct 26, 2025
Viaarxiv icon

MAGIC-Talk: Motion-aware Audio-Driven Talking Face Generation with Customizable Identity Control

Add code
Oct 26, 2025
Viaarxiv icon

Semantic Surgery: Zero-Shot Concept Erasure in Diffusion Models

Add code
Oct 26, 2025
Viaarxiv icon

Self-Calibrated Consistency can Fight Back for Adversarial Robustness in Vision-Language Models

Add code
Oct 26, 2025
Viaarxiv icon

OFFSIDE: Benchmarking Unlearning Misinformation in Multimodal Large Language Models

Add code
Oct 26, 2025
Viaarxiv icon

TWC-SLAM: Multi-Agent Cooperative SLAM with Text Semantics and WiFi Features Integration for Similar Indoor Environments

Add code
Oct 26, 2025
Viaarxiv icon

SRSR: Enhancing Semantic Accuracy in Real-World Image Super-Resolution with Spatially Re-Focused Text-Conditioning

Add code
Oct 26, 2025
Viaarxiv icon

A Comprehensive Dataset for Human vs. AI Generated Text Detection

Add code
Oct 26, 2025
Viaarxiv icon