Picture for Masashi Sugiyama

Masashi Sugiyama

Tokyo Institute of Technology

Shaping Schema via Language Representation as the Next Frontier for LLM Intelligence Expanding

Add code
May 10, 2026
Viaarxiv icon

Decomposing the Basic Abilities of Large Language Models: Mitigating Cross-Task Interference in Multi-Task Instruct-Tuning

Add code
May 07, 2026
Viaarxiv icon

Data-dependent Exploration for Online Reinforcement Learning from Human Feedback

Add code
May 06, 2026
Viaarxiv icon

Riemannian Langevin Dynamics: Strong Convergence of Geometric Euler-Maruyama Scheme

Add code
Mar 04, 2026
Viaarxiv icon

Are Multimodal Large Language Models Good Annotators for Image Tagging?

Add code
Feb 24, 2026
Viaarxiv icon

VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

Add code
Feb 13, 2026
Viaarxiv icon

BrokenBind: Universal Modality Exploration beyond Dataset Boundaries

Add code
Feb 06, 2026
Viaarxiv icon

Bifrost: Steering Strategic Trajectories to Bridge Contextual Gaps for Self-Improving Agents

Add code
Feb 05, 2026
Viaarxiv icon

Causal Graph Learning via Distributional Invariance of Cause-Effect Relationship

Add code
Feb 03, 2026
Viaarxiv icon

Positive-Unlabeled Reinforcement Learning Distillation for On-Premise Small Models

Add code
Jan 28, 2026
Viaarxiv icon