Picture for David Acuna

David Acuna

Long Grounded Thoughts: Distilling Compositional Visual Reasoning Chains at Scale

Add code
Nov 07, 2025
Viaarxiv icon

Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions

Add code
Jun 10, 2025
Viaarxiv icon

Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning

Add code
May 26, 2025
Viaarxiv icon

LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception

Add code
Apr 21, 2025
Viaarxiv icon

Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning

Add code
Apr 06, 2025
Figure 1 for Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning
Figure 2 for Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning
Figure 3 for Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning
Figure 4 for Retro-Search: Exploring Untaken Paths for Deeper and Efficient Reasoning
Viaarxiv icon

Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models

Add code
Sep 15, 2024
Figure 1 for Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models
Figure 2 for Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models
Figure 3 for Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models
Figure 4 for Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models
Viaarxiv icon

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

Add code
Aug 19, 2024
Figure 1 for Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering
Figure 2 for Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering
Figure 3 for Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering
Figure 4 for Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering
Viaarxiv icon

RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting

Add code
Apr 16, 2024
Figure 1 for RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting
Figure 2 for RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting
Figure 3 for RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting
Figure 4 for RefFusion: Reference Adapted Diffusion Models for 3D Scene Inpainting
Viaarxiv icon

Can Feedback Enhance Semantic Grounding in Large Vision-Language Models?

Add code
Apr 09, 2024
Viaarxiv icon

DreamTeacher: Pretraining Image Backbones with Deep Generative Models

Add code
Jul 14, 2023
Figure 1 for DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Figure 2 for DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Figure 3 for DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Figure 4 for DreamTeacher: Pretraining Image Backbones with Deep Generative Models
Viaarxiv icon