Picture for Masahiro Kaneko

Masahiro Kaneko

A Japanese Benchmark for Evaluating Social Bias in Reasoning Based on Attribution Theory

Add code
Apr 01, 2026
Viaarxiv icon

JUBAKU: An Adversarial Benchmark for Exposing Culturally Grounded Stereotypes in Japanese LLMs

Add code
Mar 21, 2026
Viaarxiv icon

Beyond the Resumé: A Rubric-Aware Automatic Interview System for Information Elicitation

Add code
Mar 02, 2026
Viaarxiv icon

JailNewsBench: Multi-Lingual and Regional Benchmark for Fake News Generation under Jailbreak Attacks

Add code
Mar 01, 2026
Viaarxiv icon

Autoregressive Direct Preference Optimization

Add code
Feb 10, 2026
Viaarxiv icon

Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding

Add code
Feb 06, 2026
Viaarxiv icon

Paraphrasing Adversarial Attack on LLM-as-a-Reviewer

Add code
Jan 11, 2026
Viaarxiv icon

Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models

Add code
Mar 08, 2025
Viaarxiv icon

Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning

Add code
Feb 28, 2025
Figure 1 for Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning
Figure 2 for Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning
Figure 3 for Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning
Figure 4 for Rectifying Belief Space via Unlearning to Harness LLMs' Reasoning
Viaarxiv icon

Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models

Add code
Feb 17, 2025
Figure 1 for Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Figure 2 for Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Figure 3 for Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Figure 4 for Balanced Multi-Factor In-Context Learning for Multilingual Large Language Models
Viaarxiv icon