Picture for Georgia Gkioxari

Georgia Gkioxari

NitroGen: An Open Foundation Model for Generalist Gaming Agents

Add code
Jan 04, 2026
Viaarxiv icon

Same or Not? Enhancing Visual Perception in Vision-Language Models

Add code
Dec 29, 2025
Viaarxiv icon

Feedforward 3D Editing via Text-Steerable Image-to-3D

Add code
Dec 15, 2025
Viaarxiv icon

No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers

Add code
Dec 09, 2025
Viaarxiv icon

Is This Tracker On? A Benchmark Protocol for Dynamic Tracking

Add code
Oct 22, 2025
Viaarxiv icon

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

Add code
Jun 09, 2025
Viaarxiv icon

Aligning Text, Images, and 3D Structure Token-by-Token

Add code
Jun 09, 2025
Viaarxiv icon

MonoTher-Depth: Enhancing Thermal Depth Estimation via Confidence-Aware Distillation

Add code
Apr 21, 2025
Viaarxiv icon

Is CLIP ideal? No. Can we fix it? Yes!

Add code
Mar 10, 2025
Figure 1 for Is CLIP ideal? No. Can we fix it? Yes!
Figure 2 for Is CLIP ideal? No. Can we fix it? Yes!
Figure 3 for Is CLIP ideal? No. Can we fix it? Yes!
Figure 4 for Is CLIP ideal? No. Can we fix it? Yes!
Viaarxiv icon

Visual Agentic AI for Spatial Reasoning with a Dynamic API

Add code
Feb 10, 2025
Figure 1 for Visual Agentic AI for Spatial Reasoning with a Dynamic API
Figure 2 for Visual Agentic AI for Spatial Reasoning with a Dynamic API
Figure 3 for Visual Agentic AI for Spatial Reasoning with a Dynamic API
Figure 4 for Visual Agentic AI for Spatial Reasoning with a Dynamic API
Viaarxiv icon