Picture for Tom Goldstein

Tom Goldstein

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

Add code
Jul 22, 2025
Viaarxiv icon

ARGUS: Hallucination and Omission Evaluation in Video-LLMs

Add code
Jun 09, 2025
Viaarxiv icon

Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes

Add code
Jun 09, 2025
Viaarxiv icon

A Fictional Q&A Dataset for Studying Memorization and Knowledge Acquisition

Add code
Jun 05, 2025
Viaarxiv icon

Quantifying Cross-Modality Memorization in Vision-Language Models

Add code
Jun 05, 2025
Viaarxiv icon

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Add code
Jun 05, 2025
Viaarxiv icon

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

Zero-Shot Vision Encoder Grafting via LLM Surrogates

Add code
May 28, 2025
Viaarxiv icon

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Add code
May 14, 2025
Viaarxiv icon

Leveraging AI for Productive and Trustworthy HPC Software: Challenges and Research Directions

Add code
May 13, 2025
Viaarxiv icon