Text


Generative Refinement Networks for Visual Synthesis

Add code
Apr 14, 2026
Viaarxiv icon

Representation geometry shapes task performance in vision-language modeling for CT enterography

Add code
Apr 14, 2026
Viaarxiv icon

Sparse Contrastive Learning for Content-Based Cold Item Recommendation

Add code
Apr 14, 2026
Viaarxiv icon

Modeling Co-Pilots for Text-to-Model Translation

Add code
Apr 14, 2026
Viaarxiv icon

Round-Trip Translation Reveals What Frontier Multilingual Benchmarks Miss

Add code
Apr 14, 2026
Viaarxiv icon

Robotic Manipulation is Vision-to-Geometry Mapping ($f(v) \rightarrow G$): Vision-Geometry Backbones over Language and Video Models

Add code
Apr 14, 2026
Viaarxiv icon

VideoFlexTok: Flexible-Length Coarse-to-Fine Video Tokenization

Add code
Apr 14, 2026
Viaarxiv icon

Teaching LLMs Human-Like Editing of Inappropriate Argumentation via Reinforcement Learning

Add code
Apr 14, 2026
Viaarxiv icon

NaviRAG: Towards Active Knowledge Navigation for Retrieval-Augmented Generation

Add code
Apr 14, 2026
Viaarxiv icon

MISID: A Multimodal Multi-turn Dataset for Complex Intent Recognition in Strategic Deception Games

Add code
Apr 14, 2026
Viaarxiv icon