Alert button
Picture for Wei Shen

Wei Shen

Alert button

Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation

Add code
Bookmark button
Alert button
Apr 19, 2024
Chongjie Si, Xuehui Wang, Xiaokang Yang, Wei Shen

Viaarxiv icon

FecTek: Enhancing Term Weight in Lexicon-Based Retrieval with Feature Context and Term-level Knowledge

Add code
Bookmark button
Alert button
Apr 18, 2024
Zunran Wang, Zhonghua Li, Wei Shen, Qi Ye, Liqiang Nie

Viaarxiv icon

EndoGSLAM: Real-Time Dense Reconstruction and Tracking in Endoscopic Surgeries using Gaussian Splatting

Add code
Bookmark button
Alert button
Mar 22, 2024
Kailing Wang, Chen Yang, Yuehao Wang, Sikuang Li, Yan Wang, Qi Dou, Xiaokang Yang, Wei Shen

Viaarxiv icon

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

Add code
Bookmark button
Alert button
Mar 14, 2024
Wei Shen, Xiaoying Zhang, Yuanshun Yao, Rui Zheng, Hongyi Guo, Yang Liu

Figure 1 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 2 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 3 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 4 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Viaarxiv icon

Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation

Add code
Bookmark button
Alert button
Mar 08, 2024
Xiaoying Zhang, Jean-Francois Ton, Wei Shen, Hongning Wang, Yang Liu

Figure 1 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Figure 2 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Figure 3 for Overcoming Reward Overoptimization via Adversarial Policy Optimization with Lightweight Uncertainty Estimation
Viaarxiv icon

GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

Add code
Bookmark button
Alert button
Feb 20, 2024
Chen Yang, Sikuang Li, Jiemin Fang, Ruofan Liang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

Viaarxiv icon

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 08, 2024
Zhiheng Xi, Wenxiang Chen, Boyang Hong, Senjie Jin, Rui Zheng, Wei He, Yiwen Ding, Shichun Liu, Xin Guo, Junzhe Wang, Honglin Guo, Wei Shen, Xiaoran Fan, Yuhao Zhou, Shihan Dou, Xiao Wang, Xinbo Zhang, Peng Sun, Tao Gui, Qi Zhang, Xuanjing Huang

Viaarxiv icon

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

Add code
Bookmark button
Alert button
Feb 05, 2024
Shihan Dou, Yan Liu, Haoxiang Jia, Limao Xiong, Enyu Zhou, Wei Shen, Junjie Shan, Caishuang Huang, Xiao Wang, Xiaoran Fan, Zhiheng Xi, Yuhao Zhou, Tao Ji, Rui Zheng, Qi Zhang, Xuanjing Huang, Tao Gui

Viaarxiv icon

ViTree: Single-path Neural Tree for Step-wise Interpretable Fine-grained Visual Categorization

Add code
Bookmark button
Alert button
Jan 30, 2024
Danning Lao, Qi Liu, Jiazi Bu, Junchi Yan, Wei Shen

Viaarxiv icon