Image


OmniScience: A Large-scale Multi-modal Dataset for Scientific Image Understanding

Add code
Feb 14, 2026
Viaarxiv icon

RMPL: Relation-aware Multi-task Progressive Learning with Stage-wise Training for Multimedia Event Extraction

Add code
Feb 14, 2026
Viaarxiv icon

OneLatent: Single-Token Compression for Visual Latent Reasoning

Add code
Feb 14, 2026
Viaarxiv icon

HybridFlow: A Two-Step Generative Policy for Robotic Manipulation

Add code
Feb 14, 2026
Viaarxiv icon

Fine-tuned Vision Language Model for Localization of Parasitic Eggs in Microscopic Images

Add code
Feb 14, 2026
Viaarxiv icon

A WDLoRA-Based Multimodal Generative Framework for Clinically Guided Corneal Confocal Microscopy Image Synthesis in Diabetic Neuropathy

Add code
Feb 14, 2026
Viaarxiv icon

Optimized Certainty Equivalent Risk-Controlling Prediction Sets

Add code
Feb 14, 2026
Viaarxiv icon

Differentiable Rule Induction from Raw Sequence Inputs

Add code
Feb 14, 2026
Viaarxiv icon

Privacy-Concealing Cooperative Perception for BEV Scene Segmentation

Add code
Feb 14, 2026
Viaarxiv icon

Fast Swap-Based Element Selection for Multiplication-Free Dimension Reduction

Add code
Feb 14, 2026
Viaarxiv icon