Picture for Nakamasa Inoue

Nakamasa Inoue

BioVITA: Biological Dataset, Model, and Benchmark for Visual-Textual-Acoustic Alignment

Add code
Mar 25, 2026
Viaarxiv icon

AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference

Add code
Mar 23, 2026
Viaarxiv icon

PhysQuantAgent: An Inference Pipeline of Mass Estimation for Vision-Language Models

Add code
Mar 17, 2026
Viaarxiv icon

Autoregressive Direct Preference Optimization

Add code
Feb 10, 2026
Viaarxiv icon

From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

DISCODE: Distribution-Aware Score Decoder for Robust Automatic Evaluation of Image Captioning

Add code
Dec 16, 2025
Viaarxiv icon

STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models

Add code
Oct 26, 2025
Figure 1 for STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models
Figure 2 for STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models
Figure 3 for STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models
Figure 4 for STATUS Bench: A Rigorous Benchmark for Evaluating Object State Understanding in Vision-Language Models
Viaarxiv icon

AgroBench: Vision-Language Model Benchmark in Agriculture

Add code
Jul 28, 2025
Figure 1 for AgroBench: Vision-Language Model Benchmark in Agriculture
Figure 2 for AgroBench: Vision-Language Model Benchmark in Agriculture
Figure 3 for AgroBench: Vision-Language Model Benchmark in Agriculture
Figure 4 for AgroBench: Vision-Language Model Benchmark in Agriculture
Viaarxiv icon

AnimalClue: Recognizing Animals by their Traces

Add code
Jul 27, 2025
Figure 1 for AnimalClue: Recognizing Animals by their Traces
Figure 2 for AnimalClue: Recognizing Animals by their Traces
Figure 3 for AnimalClue: Recognizing Animals by their Traces
Figure 4 for AnimalClue: Recognizing Animals by their Traces
Viaarxiv icon

Free Random Projection for In-Context Reinforcement Learning

Add code
Apr 09, 2025
Figure 1 for Free Random Projection for In-Context Reinforcement Learning
Figure 2 for Free Random Projection for In-Context Reinforcement Learning
Figure 3 for Free Random Projection for In-Context Reinforcement Learning
Figure 4 for Free Random Projection for In-Context Reinforcement Learning
Viaarxiv icon