Picture for Ruoxuan Zhang

Ruoxuan Zhang

PinpointQA: A Dataset and Benchmark for Small Object-Centric Spatial Understanding in Indoor Videos

Add code
Apr 10, 2026
Viaarxiv icon

Aligning Progress and Feasibility: A Neuro-Symbolic Dual Memory Framework for Long-Horizon LLM Agents

Add code
Apr 03, 2026
Viaarxiv icon

RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation

Add code
Jun 07, 2025
Figure 1 for RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation
Figure 2 for RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation
Figure 3 for RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation
Figure 4 for RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation
Viaarxiv icon

RecipeGen: A Benchmark for Real-World Recipe Image Generation

Add code
Mar 07, 2025
Figure 1 for RecipeGen: A Benchmark for Real-World Recipe Image Generation
Figure 2 for RecipeGen: A Benchmark for Real-World Recipe Image Generation
Figure 3 for RecipeGen: A Benchmark for Real-World Recipe Image Generation
Figure 4 for RecipeGen: A Benchmark for Real-World Recipe Image Generation
Viaarxiv icon