Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation

Add code
May 17, 2024
Viaarxiv icon

RLHF Workflow: From Reward Modeling to Online RLHF

Add code
May 13, 2024
Figure 1 for RLHF Workflow: From Reward Modeling to Online RLHF
Figure 2 for RLHF Workflow: From Reward Modeling to Online RLHF
Figure 3 for RLHF Workflow: From Reward Modeling to Online RLHF
Figure 4 for RLHF Workflow: From Reward Modeling to Online RLHF
Viaarxiv icon

An Efficient Algorithm for Sum-Rate Maximization in Fluid Antenna-Assisted ISAC System

Add code
May 10, 2024
Viaarxiv icon

SC-HVPPNet: Spatial and Channel Hybrid-Attention Video Post-Processing Network with CNN and Transformer

Add code
Apr 23, 2024
Viaarxiv icon

Incremental Self-training for Semi-supervised Learning

Add code
Apr 14, 2024
Figure 1 for Incremental Self-training for Semi-supervised Learning
Figure 2 for Incremental Self-training for Semi-supervised Learning
Figure 3 for Incremental Self-training for Semi-supervised Learning
Figure 4 for Incremental Self-training for Semi-supervised Learning
Viaarxiv icon

Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange

Add code
Apr 11, 2024
Figure 1 for Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange
Figure 2 for Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange
Figure 3 for Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange
Figure 4 for Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange
Viaarxiv icon

Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm

Add code
Apr 04, 2024
Viaarxiv icon

Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization

Add code
Apr 02, 2024
Figure 1 for Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Figure 2 for Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Figure 3 for Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Figure 4 for Towards Better Generalization in Open-Domain Question Answering by Mitigating Context Memorization
Viaarxiv icon

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Add code
Mar 28, 2024
Figure 1 for LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
Figure 2 for LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
Figure 3 for LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
Figure 4 for LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
Viaarxiv icon

On the Benefits of Over-parameterization for Out-of-Distribution Generalization

Add code
Mar 26, 2024
Viaarxiv icon