Picture for Taiki Miyanishi

Taiki Miyanishi

Stitch4D: Sparse Multi-Location 4D Urban Reconstruction via Spatio-Temporal Interpolation

Add code
Apr 09, 2026
Viaarxiv icon

EC-Bench: Enumeration and Counting Benchmark for Ultra-Long Videos

Add code
Mar 31, 2026
Viaarxiv icon

PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models

Add code
Mar 17, 2026
Viaarxiv icon

LegalViz: Legal Text Visualization by Text To Diagram Generation

Add code
Feb 10, 2025
Viaarxiv icon

Answerability Fields: Answerable Location Estimation via Diffusion Models

Add code
Jul 26, 2024
Figure 1 for Answerability Fields: Answerable Location Estimation via Diffusion Models
Figure 2 for Answerability Fields: Answerable Location Estimation via Diffusion Models
Figure 3 for Answerability Fields: Answerable Location Estimation via Diffusion Models
Figure 4 for Answerability Fields: Answerable Location Estimation via Diffusion Models
Viaarxiv icon

CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information

Add code
Jun 20, 2024
Figure 1 for CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information
Figure 2 for CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information
Figure 3 for CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information
Figure 4 for CityNav: Language-Goal Aerial Navigation Dataset with Geographic Information
Viaarxiv icon

Map-based Modular Approach for Zero-shot Embodied Question Answering

Add code
May 26, 2024
Viaarxiv icon

JDocQA: Japanese Document Question Answering Dataset for Generative Language Models

Add code
Mar 28, 2024
Figure 1 for JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Figure 2 for JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Figure 3 for JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Figure 4 for JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Viaarxiv icon

Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction

Add code
Feb 28, 2024
Figure 1 for Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction
Figure 2 for Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction
Figure 3 for Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction
Figure 4 for Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction
Viaarxiv icon

CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud Data

Add code
Oct 28, 2023
Viaarxiv icon