Picture for Jiantao Qiu

Jiantao Qiu

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

Add code
Jun 08, 2025
Viaarxiv icon

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification

Add code
Jun 08, 2025
Viaarxiv icon

Not All Documents Are What You Need for Extracting Instruction Tuning Data

Add code
May 18, 2025
Viaarxiv icon

Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models

Add code
Apr 19, 2025
Viaarxiv icon

WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages

Add code
Jan 24, 2025
Figure 1 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 2 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 3 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 4 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Viaarxiv icon

Harnessing Diversity for Important Data Selection in Pretraining Large Language Models

Add code
Sep 25, 2024
Viaarxiv icon

InternLM2 Technical Report

Add code
Mar 26, 2024
Figure 1 for InternLM2 Technical Report
Figure 2 for InternLM2 Technical Report
Figure 3 for InternLM2 Technical Report
Figure 4 for InternLM2 Technical Report
Viaarxiv icon

WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset

Add code
Mar 12, 2024
Figure 1 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 2 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 3 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Figure 4 for WanJuan-CC: A Safe and High-Quality Open-sourced English Webtext Dataset
Viaarxiv icon

WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models

Add code
Aug 22, 2023
Viaarxiv icon

Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration

Add code
Feb 24, 2022
Figure 1 for Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration
Figure 2 for Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration
Figure 3 for Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration
Figure 4 for Explore-Bench: Data Sets, Metrics and Evaluations for Frontier-based and Deep-reinforcement-learning-based Autonomous Exploration
Viaarxiv icon