Picture for Ryota Yoshihashi

Ryota Yoshihashi

What-Where Transformer: A Slot-Centric Visual Backbone for Concurrent Representation and Localization

Add code
May 12, 2026
Viaarxiv icon

Teacher-Guided Routing for Sparse Vision Mixture-of-Experts

Add code
Apr 23, 2026
Viaarxiv icon

VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction

Add code
Dec 05, 2024
Figure 1 for VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction
Figure 2 for VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction
Figure 3 for VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction
Figure 4 for VASCAR: Content-Aware Layout Generation via Visual-Aware Self-Correction
Viaarxiv icon

Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models

Add code
Nov 19, 2024
Figure 1 for Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models
Figure 2 for Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models
Figure 3 for Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models
Figure 4 for Constant Rate Schedule: Constant-Rate Distributional Change for Efficient Training and Sampling in Diffusion Models
Viaarxiv icon

Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion

Add code
Sep 04, 2023
Viaarxiv icon

Ladder Siamese Network: a Method and Insights for Multi-level Self-Supervised Learning

Add code
Nov 25, 2022
Figure 1 for Ladder Siamese Network: a Method and Insights for Multi-level Self-Supervised Learning
Figure 2 for Ladder Siamese Network: a Method and Insights for Multi-level Self-Supervised Learning
Figure 3 for Ladder Siamese Network: a Method and Insights for Multi-level Self-Supervised Learning
Figure 4 for Ladder Siamese Network: a Method and Insights for Multi-level Self-Supervised Learning
Viaarxiv icon

Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition

Add code
Jun 10, 2021
Figure 1 for Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition
Figure 2 for Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition
Figure 3 for Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition
Figure 4 for Context-Free TextSpotter for Real-Time and Mobile End-to-End Text Detection and Recognition
Viaarxiv icon

Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach

Add code
May 18, 2021
Figure 1 for Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach
Figure 2 for Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach
Figure 3 for Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach
Figure 4 for Finding a Needle in a Haystack: Tiny Flying Object Detection in 4K Videos using a Joint Detection-and-Tracking Approach
Viaarxiv icon

Hybrid Loss for Learning Single-Image-based HDR Reconstruction

Add code
Dec 18, 2018
Figure 1 for Hybrid Loss for Learning Single-Image-based HDR Reconstruction
Figure 2 for Hybrid Loss for Learning Single-Image-based HDR Reconstruction
Figure 3 for Hybrid Loss for Learning Single-Image-based HDR Reconstruction
Figure 4 for Hybrid Loss for Learning Single-Image-based HDR Reconstruction
Viaarxiv icon

Classification-Reconstruction Learning for Open-Set Recognition

Add code
Dec 17, 2018
Figure 1 for Classification-Reconstruction Learning for Open-Set Recognition
Figure 2 for Classification-Reconstruction Learning for Open-Set Recognition
Figure 3 for Classification-Reconstruction Learning for Open-Set Recognition
Figure 4 for Classification-Reconstruction Learning for Open-Set Recognition
Viaarxiv icon