Picture for Osamu Yoshie

Osamu Yoshie

MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment

Add code
Jun 28, 2024
Viaarxiv icon

BreakGPT: A Large Language Model with Multi-stage Structure for Financial Breakout Detection

Add code
Feb 12, 2024
Viaarxiv icon

PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection

Add code
Nov 29, 2023
Figure 1 for PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection
Figure 2 for PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection
Figure 3 for PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection
Figure 4 for PillarNeSt: Embracing Backbone Scaling and Pretraining for Pillar-based 3D Object Detection
Viaarxiv icon

GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection

Add code
Jun 30, 2023
Figure 1 for GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection
Figure 2 for GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection
Figure 3 for GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection
Figure 4 for GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection
Viaarxiv icon

Vision Learners Meet Web Image-Text Pairs

Add code
Jan 17, 2023
Figure 1 for Vision Learners Meet Web Image-Text Pairs
Figure 2 for Vision Learners Meet Web Image-Text Pairs
Figure 3 for Vision Learners Meet Web Image-Text Pairs
Figure 4 for Vision Learners Meet Web Image-Text Pairs
Viaarxiv icon

Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective

Add code
Mar 08, 2022
Figure 1 for Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective
Figure 2 for Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective
Figure 3 for Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective
Figure 4 for Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective
Viaarxiv icon

ZeroVL: A Strong Baseline for Aligning Vision-Language Representations with Limited Resources

Add code
Jan 18, 2022
Figure 1 for ZeroVL: A Strong Baseline for Aligning Vision-Language Representations with Limited Resources
Figure 2 for ZeroVL: A Strong Baseline for Aligning Vision-Language Representations with Limited Resources
Figure 3 for ZeroVL: A Strong Baseline for Aligning Vision-Language Representations with Limited Resources
Figure 4 for ZeroVL: A Strong Baseline for Aligning Vision-Language Representations with Limited Resources
Viaarxiv icon

PP-YOLOv2: A Practical Object Detector

Add code
Apr 21, 2021
Figure 1 for PP-YOLOv2: A Practical Object Detector
Figure 2 for PP-YOLOv2: A Practical Object Detector
Figure 3 for PP-YOLOv2: A Practical Object Detector
Figure 4 for PP-YOLOv2: A Practical Object Detector
Viaarxiv icon

OTA: Optimal Transport Assignment for Object Detection

Add code
Mar 26, 2021
Figure 1 for OTA: Optimal Transport Assignment for Object Detection
Figure 2 for OTA: Optimal Transport Assignment for Object Detection
Figure 3 for OTA: Optimal Transport Assignment for Object Detection
Figure 4 for OTA: Optimal Transport Assignment for Object Detection
Viaarxiv icon

A Reinforcement learning method for Optical Thin-Film Design

Add code
Feb 13, 2021
Figure 1 for A Reinforcement learning method for Optical Thin-Film Design
Figure 2 for A Reinforcement learning method for Optical Thin-Film Design
Figure 3 for A Reinforcement learning method for Optical Thin-Film Design
Figure 4 for A Reinforcement learning method for Optical Thin-Film Design
Viaarxiv icon