Picture for Hirokatsu Kataoka

Hirokatsu Kataoka

3D sans 3D Scans: Scalable Pre-training from Video-Generated Point Clouds

Add code
Dec 28, 2025
Viaarxiv icon

S3OD: Towards Generalizable Salient Object Detection with Synthetic Data

Add code
Oct 24, 2025
Viaarxiv icon

AgroBench: Vision-Language Model Benchmark in Agriculture

Add code
Jul 28, 2025
Figure 1 for AgroBench: Vision-Language Model Benchmark in Agriculture
Figure 2 for AgroBench: Vision-Language Model Benchmark in Agriculture
Figure 3 for AgroBench: Vision-Language Model Benchmark in Agriculture
Figure 4 for AgroBench: Vision-Language Model Benchmark in Agriculture
Viaarxiv icon

AnimalClue: Recognizing Animals by their Traces

Add code
Jul 27, 2025
Figure 1 for AnimalClue: Recognizing Animals by their Traces
Figure 2 for AnimalClue: Recognizing Animals by their Traces
Figure 3 for AnimalClue: Recognizing Animals by their Traces
Figure 4 for AnimalClue: Recognizing Animals by their Traces
Viaarxiv icon

Industrial Synthetic Segment Pre-training

Add code
May 20, 2025
Figure 1 for Industrial Synthetic Segment Pre-training
Figure 2 for Industrial Synthetic Segment Pre-training
Figure 3 for Industrial Synthetic Segment Pre-training
Figure 4 for Industrial Synthetic Segment Pre-training
Viaarxiv icon

Industry-focused Synthetic Segmentation Pre-training

Add code
May 19, 2025
Figure 1 for Industry-focused Synthetic Segmentation Pre-training
Figure 2 for Industry-focused Synthetic Segmentation Pre-training
Figure 3 for Industry-focused Synthetic Segmentation Pre-training
Figure 4 for Industry-focused Synthetic Segmentation Pre-training
Viaarxiv icon

Simple Visual Artifact Detection in Sora-Generated Videos

Add code
Apr 30, 2025
Viaarxiv icon

Formula-Supervised Sound Event Detection: Pre-Training Without Real Data

Add code
Apr 06, 2025
Viaarxiv icon

Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes

Add code
Mar 31, 2025
Figure 1 for Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes
Figure 2 for Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes
Figure 3 for Pre-training with 3D Synthetic Data: Learning 3D Point Cloud Instance Segmentation from 3D Synthetic Scenes
Viaarxiv icon

Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering

Add code
Mar 27, 2025
Figure 1 for Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering
Figure 2 for Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering
Figure 3 for Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering
Figure 4 for Leveraging LLMs with Iterative Loop Structure for Enhanced Social Intelligence in Video Question Answering
Viaarxiv icon