Picture for Rio Yokota

Rio Yokota

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Add code
Jul 04, 2024
Viaarxiv icon

Building a Large Japanese Web Corpus for Large Language Models

Add code
Apr 27, 2024
Viaarxiv icon

Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities

Add code
Apr 27, 2024
Figure 1 for Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities
Figure 2 for Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities
Figure 3 for Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities
Figure 4 for Continual Pre-Training for Cross-Lingual LLM Adaptation: Enhancing Japanese Language Capabilities
Viaarxiv icon

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Add code
Mar 30, 2024
Viaarxiv icon

Variational Learning is Effective for Large Deep Networks

Add code
Feb 27, 2024
Viaarxiv icon

SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning

Add code
Sep 29, 2023
Figure 1 for SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning
Figure 2 for SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning
Figure 3 for SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning
Figure 4 for SegRCDB: Semantic Segmentation via Formula-Driven Supervised Learning
Viaarxiv icon

Pre-training Vision Transformers with Very Limited Synthesized Images

Add code
Jul 31, 2023
Figure 1 for Pre-training Vision Transformers with Very Limited Synthesized Images
Figure 2 for Pre-training Vision Transformers with Very Limited Synthesized Images
Figure 3 for Pre-training Vision Transformers with Very Limited Synthesized Images
Figure 4 for Pre-training Vision Transformers with Very Limited Synthesized Images
Viaarxiv icon

ASDL: A Unified Interface for Gradient Preconditioning in PyTorch

Add code
May 08, 2023
Figure 1 for ASDL: A Unified Interface for Gradient Preconditioning in PyTorch
Figure 2 for ASDL: A Unified Interface for Gradient Preconditioning in PyTorch
Figure 3 for ASDL: A Unified Interface for Gradient Preconditioning in PyTorch
Figure 4 for ASDL: A Unified Interface for Gradient Preconditioning in PyTorch
Viaarxiv icon

Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves

Add code
Mar 02, 2023
Figure 1 for Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Figure 2 for Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Figure 3 for Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Figure 4 for Visual Atoms: Pre-training Vision Transformers with Sinusoidal Waves
Viaarxiv icon

Empirical Study on Optimizer Selection for Out-of-Distribution Generalization

Add code
Nov 18, 2022
Figure 1 for Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Figure 2 for Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Figure 3 for Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Figure 4 for Empirical Study on Optimizer Selection for Out-of-Distribution Generalization
Viaarxiv icon