Picture for Tatsuya Harada

Tatsuya Harada

Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation

Add code
Jan 18, 2024
Figure 1 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 2 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 3 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Figure 4 for Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation
Viaarxiv icon

Aleth-NeRF: Illumination Adaptive NeRF with Concealing Field Assumption

Add code
Dec 15, 2023
Viaarxiv icon

Can Physician Judgment Enhance Model Trustworthiness? A Case Study on Predicting Pathological Lymph Nodes in Rectal Cancer

Add code
Dec 15, 2023
Viaarxiv icon

Combining inherent knowledge of vision-language models with unsupervised domain adaptation through self-knowledge distillation

Add code
Dec 11, 2023
Figure 1 for Combining inherent knowledge of vision-language models with unsupervised domain adaptation through self-knowledge distillation
Figure 2 for Combining inherent knowledge of vision-language models with unsupervised domain adaptation through self-knowledge distillation
Figure 3 for Combining inherent knowledge of vision-language models with unsupervised domain adaptation through self-knowledge distillation
Figure 4 for Combining inherent knowledge of vision-language models with unsupervised domain adaptation through self-knowledge distillation
Viaarxiv icon

Fully Spiking Denoising Diffusion Implicit Models

Add code
Dec 04, 2023
Viaarxiv icon

Gradual Source Domain Expansion for Unsupervised Domain Adaptation

Add code
Nov 16, 2023
Viaarxiv icon

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Add code
Oct 17, 2023
Figure 1 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 2 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 3 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 4 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Viaarxiv icon

Expert Uncertainty and Severity Aware Chest X-Ray Classification by Multi-Relationship Graph Learning

Add code
Sep 06, 2023
Viaarxiv icon

The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track

Add code
Aug 14, 2023
Figure 1 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
Figure 2 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
Figure 3 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
Figure 4 for The Sound Demixing Challenge 2023 $\unicode{x2013}$ Music Demixing Track
Viaarxiv icon

Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering

Add code
Jul 22, 2023
Figure 1 for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering
Figure 2 for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering
Figure 3 for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering
Figure 4 for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering
Viaarxiv icon