Picture for Qintong Li

Qintong Li

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

Add code
May 26, 2025
Viaarxiv icon

Activation-Guided Consensus Merging for Large Language Models

Add code
May 20, 2025
Viaarxiv icon

TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning

Add code
Mar 21, 2025
Viaarxiv icon

Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration

Add code
Oct 22, 2024
Figure 1 for Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Figure 2 for Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Figure 3 for Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Figure 4 for Forewarned is Forearmed: Leveraging LLMs for Data Synthesis through Failure-Inducing Exploration
Viaarxiv icon

Privacy in LLM-based Recommendation: Recent Advances and Future Directions

Add code
Jun 03, 2024
Viaarxiv icon

GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers

Add code
Feb 29, 2024
Figure 1 for GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Figure 2 for GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Figure 3 for GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Figure 4 for GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers
Viaarxiv icon

BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models

Add code
Feb 21, 2024
Figure 1 for BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Figure 2 for BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Figure 3 for BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Figure 4 for BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models
Viaarxiv icon

Collaborative Evaluation: Exploring the Synergy of Large Language Models and Humans for Open-ended Generation Evaluation

Add code
Oct 30, 2023
Figure 1 for Collaborative Evaluation: Exploring the Synergy of Large Language Models and Humans for Open-ended Generation Evaluation
Figure 2 for Collaborative Evaluation: Exploring the Synergy of Large Language Models and Humans for Open-ended Generation Evaluation
Figure 3 for Collaborative Evaluation: Exploring the Synergy of Large Language Models and Humans for Open-ended Generation Evaluation
Figure 4 for Collaborative Evaluation: Exploring the Synergy of Large Language Models and Humans for Open-ended Generation Evaluation
Viaarxiv icon

Deepfake Text Detection in the Wild

Add code
May 22, 2023
Figure 1 for Deepfake Text Detection in the Wild
Figure 2 for Deepfake Text Detection in the Wild
Figure 3 for Deepfake Text Detection in the Wild
Figure 4 for Deepfake Text Detection in the Wild
Viaarxiv icon

A Cognitive Stimulation Dialogue System with Multi-source Knowledge Fusion for Elders with Cognitive Impairment

Add code
May 14, 2023
Viaarxiv icon