Picture for Masao Utiyama

Masao Utiyama

CADEL: A Corpus of Administrative Web Documents for Japanese Entity Linking

Add code
Mar 31, 2026
Viaarxiv icon

OptiMer: Optimal Distribution Vector Merging Is Better than Data Mixing for Continual Pre-Training

Add code
Mar 30, 2026
Viaarxiv icon

Minimum Bayes Risk Decoding for Error Span Detection in Reference-Free Automatic Machine Translation Evaluation

Add code
Dec 19, 2025
Figure 1 for Minimum Bayes Risk Decoding for Error Span Detection in Reference-Free Automatic Machine Translation Evaluation
Figure 2 for Minimum Bayes Risk Decoding for Error Span Detection in Reference-Free Automatic Machine Translation Evaluation
Figure 3 for Minimum Bayes Risk Decoding for Error Span Detection in Reference-Free Automatic Machine Translation Evaluation
Figure 4 for Minimum Bayes Risk Decoding for Error Span Detection in Reference-Free Automatic Machine Translation Evaluation
Viaarxiv icon

PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation

Add code
Dec 15, 2025
Viaarxiv icon

Comprehensive Evaluation on Lexical Normalization: Boundary-Aware Approaches for Unsegmented Languages

Add code
May 28, 2025
Viaarxiv icon

TikZero: Zero-Shot Text-Guided Graphics Program Synthesis

Add code
Mar 14, 2025
Figure 1 for TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
Figure 2 for TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
Figure 3 for TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
Figure 4 for TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
Viaarxiv icon

IteRABRe: Iterative Recovery-Aided Block Reduction

Add code
Mar 08, 2025
Figure 1 for IteRABRe: Iterative Recovery-Aided Block Reduction
Figure 2 for IteRABRe: Iterative Recovery-Aided Block Reduction
Figure 3 for IteRABRe: Iterative Recovery-Aided Block Reduction
Figure 4 for IteRABRe: Iterative Recovery-Aided Block Reduction
Viaarxiv icon

Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation

Add code
Jan 06, 2025
Figure 1 for Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
Figure 2 for Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
Figure 3 for Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
Figure 4 for Registering Source Tokens to Target Language Spaces in Multilingual Neural Machine Translation
Viaarxiv icon

Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation

Add code
Dec 03, 2024
Figure 1 for Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
Figure 2 for Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
Figure 3 for Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
Figure 4 for Improving Language Transfer Capability of Decoder-only Architecture in Multilingual Neural Machine Translation
Viaarxiv icon

On Eliciting Syntax from Language Models via Hashing

Add code
Oct 05, 2024
Viaarxiv icon