Alert button
Picture for Ruiqi Zhang

Ruiqi Zhang

Alert button

Negative Preference Optimization: From Catastrophic Collapse to Effective Unlearning

Add code
Bookmark button
Alert button
Apr 08, 2024
Ruiqi Zhang, Licong Lin, Yu Bai, Song Mei

Viaarxiv icon

Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement

Add code
Bookmark button
Alert button
Feb 24, 2024
Ruiqi Zhang, Yuexiang Zhai, Andrea Zanette

Viaarxiv icon

In-Context Learning of a Linear Transformer Block: Benefits of the MLP Component and One-Step GD Initialization

Add code
Bookmark button
Alert button
Feb 22, 2024
Ruiqi Zhang, Jingfeng Wu, Peter L. Bartlett

Viaarxiv icon

AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition

Add code
Bookmark button
Alert button
Feb 18, 2024
Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao

Viaarxiv icon

Spreeze: High-Throughput Parallel Reinforcement Learning Framework

Add code
Bookmark button
Alert button
Dec 11, 2023
Jing Hou, Guang Chen, Ruiqi Zhang, Zhijun Li, Shangding Gu, Changjun Jiang

Figure 1 for Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Figure 2 for Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Figure 3 for Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Figure 4 for Spreeze: High-Throughput Parallel Reinforcement Learning Framework
Viaarxiv icon

Explicifying Neural Implicit Fields for Efficient Dynamic Human Avatar Modeling via a Neural Explicit Surface

Add code
Bookmark button
Alert button
Aug 07, 2023
Ruiqi Zhang, Jie Chen, Qiang Wang

Figure 1 for Explicifying Neural Implicit Fields for Efficient Dynamic Human Avatar Modeling via a Neural Explicit Surface
Figure 2 for Explicifying Neural Implicit Fields for Efficient Dynamic Human Avatar Modeling via a Neural Explicit Surface
Figure 3 for Explicifying Neural Implicit Fields for Efficient Dynamic Human Avatar Modeling via a Neural Explicit Surface
Figure 4 for Explicifying Neural Implicit Fields for Efficient Dynamic Human Avatar Modeling via a Neural Explicit Surface
Viaarxiv icon

Policy Finetuning in Reinforcement Learning via Design of Experiments using Offline Data

Add code
Bookmark button
Alert button
Jul 10, 2023
Ruiqi Zhang, Andrea Zanette

Viaarxiv icon

Trained Transformers Learn Linear Models In-Context

Add code
Bookmark button
Alert button
Jun 16, 2023
Ruiqi Zhang, Spencer Frei, Peter L. Bartlett

Figure 1 for Trained Transformers Learn Linear Models In-Context
Viaarxiv icon

NDF: Neural Deformable Fields for Dynamic Human Modelling

Add code
Bookmark button
Alert button
Jul 19, 2022
Ruiqi Zhang, Jie Chen

Figure 1 for NDF: Neural Deformable Fields for Dynamic Human Modelling
Figure 2 for NDF: Neural Deformable Fields for Dynamic Human Modelling
Figure 3 for NDF: Neural Deformable Fields for Dynamic Human Modelling
Figure 4 for NDF: Neural Deformable Fields for Dynamic Human Modelling
Viaarxiv icon

Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory

Add code
Bookmark button
Alert button
Feb 10, 2022
Ruiqi Zhang, Xuezhou Zhang, Chengzhuo Ni, Mengdi Wang

Figure 1 for Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Viaarxiv icon