Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

Add code
Feb 15, 2024
Viaarxiv icon

A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference

Add code
Feb 11, 2024
Figure 1 for A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference
Figure 2 for A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference
Figure 3 for A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference
Figure 4 for A Theoretical Analysis of Nash Learning from Human Feedback under General KL-Regularized Preference
Viaarxiv icon

The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs

Add code
Feb 06, 2024
Figure 1 for The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs
Figure 2 for The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs
Figure 3 for The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs
Figure 4 for The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs
Viaarxiv icon

EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain

Add code
Feb 05, 2024
Figure 1 for EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Figure 2 for EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Figure 3 for EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Figure 4 for EarthGPT: A Universal Multi-modal Large Language Model for Multi-sensor Image Comprehension in Remote Sensing Domain
Viaarxiv icon

PipeNet: Question Answering with Semantic Pruning over Knowledge Graphs

Add code
Jan 31, 2024
Viaarxiv icon

The Surprising Harmfulness of Benign Overfitting for Adversarial Robustness

Add code
Jan 25, 2024
Figure 1 for The Surprising Harmfulness of Benign Overfitting for Adversarial Robustness
Viaarxiv icon

General Flow as Foundation Affordance for Scalable Robot Learning

Add code
Jan 21, 2024
Figure 1 for General Flow as Foundation Affordance for Scalable Robot Learning
Figure 2 for General Flow as Foundation Affordance for Scalable Robot Learning
Figure 3 for General Flow as Foundation Affordance for Scalable Robot Learning
Figure 4 for General Flow as Foundation Affordance for Scalable Robot Learning
Viaarxiv icon

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance

Add code
Jan 17, 2024
Figure 1 for MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Figure 2 for MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Figure 3 for MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Figure 4 for MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
Viaarxiv icon

Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo

Add code
Jan 12, 2024
Figure 1 for Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo
Figure 2 for Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo
Figure 3 for Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo
Figure 4 for Faster Sampling without Isoperimetry via Diffusion-based Monte Carlo
Viaarxiv icon

AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets

Add code
Jan 05, 2024
Viaarxiv icon