Picture for Rita Cucchiara

Rita Cucchiara

Tiny Inference-Time Scaling with Latent Verifiers

Add code
Mar 25, 2026
Viaarxiv icon

Dress-ED: Instruction-Guided Editing for Virtual Try-On and Try-Off

Add code
Mar 23, 2026
Viaarxiv icon

BFS-PO: Best-First Search for Large Reasoning Models

Add code
Feb 16, 2026
Viaarxiv icon

Shifting the Breaking Point of Flow Matching for Multi-Instance Editing

Add code
Feb 09, 2026
Viaarxiv icon

Restoring Exploration after Post-Training: Latent Exploration Decoding for Large Reasoning Models

Add code
Feb 02, 2026
Viaarxiv icon

A Unified Masked Jigsaw Puzzle Framework for Vision and Language Models

Add code
Jan 17, 2026
Viaarxiv icon

CounterVid: Counterfactual Video Generation for Mitigating Action and Temporal Hallucinations in Video-Language Models

Add code
Jan 08, 2026
Viaarxiv icon

Seeing Beyond Words: Self-Supervised Visual Learning for Multimodal Large Language Models

Add code
Dec 17, 2025
Viaarxiv icon

Recurrence Meets Transformers for Universal Multimodal Retrieval

Add code
Sep 10, 2025
Figure 1 for Recurrence Meets Transformers for Universal Multimodal Retrieval
Figure 2 for Recurrence Meets Transformers for Universal Multimodal Retrieval
Figure 3 for Recurrence Meets Transformers for Universal Multimodal Retrieval
Figure 4 for Recurrence Meets Transformers for Universal Multimodal Retrieval
Viaarxiv icon

Dual Orthogonal Guidance for Robust Diffusion-based Handwritten Text Generation

Add code
Aug 23, 2025
Viaarxiv icon