Picture for Tong Zhang

Tong Zhang

Nanjing University of Science and Technology, Nanjing, China

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Add code
Feb 13, 2025
Figure 1 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 2 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 3 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 4 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Viaarxiv icon

Crime Forecasting: A Spatio-temporal Analysis with Deep Learning Models

Add code
Feb 11, 2025
Figure 1 for Crime Forecasting: A Spatio-temporal Analysis with Deep Learning Models
Figure 2 for Crime Forecasting: A Spatio-temporal Analysis with Deep Learning Models
Figure 3 for Crime Forecasting: A Spatio-temporal Analysis with Deep Learning Models
Figure 4 for Crime Forecasting: A Spatio-temporal Analysis with Deep Learning Models
Viaarxiv icon

Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering

Add code
Feb 11, 2025
Figure 1 for Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering
Figure 2 for Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering
Figure 3 for Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering
Figure 4 for Intelligent Legal Assistant: An Interactive Clarification System for Legal Question Answering
Viaarxiv icon

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

Add code
Feb 11, 2025
Figure 1 for Logarithmic Regret for Online KL-Regularized Reinforcement Learning
Viaarxiv icon

Demystifying Singular Defects in Large Language Models

Add code
Feb 10, 2025
Viaarxiv icon

Nearly Optimal Sample Complexity of Offline KL-Regularized Contextual Bandits under Single-Policy Concentrability

Add code
Feb 09, 2025
Figure 1 for Nearly Optimal Sample Complexity of Offline KL-Regularized Contextual Bandits under Single-Policy Concentrability
Viaarxiv icon

Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training

Add code
Feb 05, 2025
Viaarxiv icon

Catoni Contextual Bandits are Robust to Heavy-tailed Rewards

Add code
Feb 04, 2025
Viaarxiv icon

Online-BLS: An Accurate and Efficient Online Broad Learning System for Data Stream Classification

Add code
Jan 28, 2025
Figure 1 for Online-BLS: An Accurate and Efficient Online Broad Learning System for Data Stream Classification
Figure 2 for Online-BLS: An Accurate and Efficient Online Broad Learning System for Data Stream Classification
Figure 3 for Online-BLS: An Accurate and Efficient Online Broad Learning System for Data Stream Classification
Figure 4 for Online-BLS: An Accurate and Efficient Online Broad Learning System for Data Stream Classification
Viaarxiv icon

Divergence-Augmented Policy Optimization

Add code
Jan 25, 2025
Viaarxiv icon