Alert button
Picture for Masashi Sugiyama

Masashi Sugiyama

Alert button

Leveraging Domain-Unlabeled Data in Offline Reinforcement Learning across Two Domains

Add code
Bookmark button
Alert button
Apr 11, 2024
Soichiro Nishimori, Xin-Qiang Cai, Johannes Ackermann, Masashi Sugiyama

Viaarxiv icon

Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training

Add code
Bookmark button
Alert button
Apr 09, 2024
Ming-Kun Xie, Jia-Hao Xiao, Pei Peng, Gang Niu, Masashi Sugiyama, Sheng-Jun Huang

Viaarxiv icon

Reinforcement Learning with Options and State Representation

Add code
Bookmark button
Alert button
Mar 25, 2024
Ayoub Ghriss, Masashi Sugiyama, Alessandro Lazaric

Figure 1 for Reinforcement Learning with Options and State Representation
Figure 2 for Reinforcement Learning with Options and State Representation
Figure 3 for Reinforcement Learning with Options and State Representation
Figure 4 for Reinforcement Learning with Options and State Representation
Viaarxiv icon

Reinforcement Learning with Options

Add code
Bookmark button
Alert button
Mar 16, 2024
Ayoub Ghriss, Masashi Sugiyama, Alessandro Lazaric

Figure 1 for Reinforcement Learning with Options
Figure 2 for Reinforcement Learning with Options
Figure 3 for Reinforcement Learning with Options
Figure 4 for Reinforcement Learning with Options
Viaarxiv icon

Learning with Noisy Foundation Models

Add code
Bookmark button
Alert button
Mar 11, 2024
Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj

Viaarxiv icon

VEC-SBM: Optimal Community Detection with Vectorial Edges Covariates

Add code
Bookmark button
Alert button
Feb 29, 2024
Guillaume Braun, Masashi Sugiyama

Viaarxiv icon

Generating Chain-of-Thoughts with a Direct Pairwise-Comparison Approach to Searching for the Most Promising Intermediate Thought

Add code
Bookmark button
Alert button
Feb 10, 2024
Zhen-Yu Zhang, Siwei Han, Huaxiu Yao, Gang Niu, Masashi Sugiyama

Viaarxiv icon

Reinforcement Learning from Bagged Reward: A Transformer-based Approach for Instance-Level Reward Redistribution

Add code
Bookmark button
Alert button
Feb 06, 2024
Yuting Tang, Xin-Qiang Cai, Yao-Xiang Ding, Qiyu Wu, Guoqing Liu, Masashi Sugiyama

Viaarxiv icon

A General Framework for Learning from Weak Supervision

Add code
Bookmark button
Alert button
Feb 02, 2024
Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj

Viaarxiv icon

Direct Distillation between Different Domains

Add code
Bookmark button
Alert button
Jan 12, 2024
Jialiang Tang, Shuo Chen, Gang Niu, Hongyuan Zhu, Joey Tianyi Zhou, Chen Gong, Masashi Sugiyama

Viaarxiv icon