Picture for Yule Liu

Yule Liu

ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities

Add code
Aug 20, 2025
Figure 1 for ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Figure 2 for ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Figure 3 for ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Figure 4 for ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Viaarxiv icon

Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks

Add code
May 28, 2025
Figure 1 for Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks
Figure 2 for Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks
Figure 3 for Evaluation Hallucination in Multi-Round Incomplete Information Lateral-Driven Reasoning Tasks
Viaarxiv icon

JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models

Add code
May 23, 2025
Figure 1 for JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Figure 2 for JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Figure 3 for JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Figure 4 for JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Viaarxiv icon

Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications

Add code
Apr 30, 2025
Figure 1 for Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Figure 2 for Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Figure 3 for Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Figure 4 for Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Viaarxiv icon

Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models

Add code
Apr 18, 2025
Viaarxiv icon

The Rising Threat to Emerging AI-Powered Search Engines

Add code
Feb 07, 2025
Figure 1 for The Rising Threat to Emerging AI-Powered Search Engines
Figure 2 for The Rising Threat to Emerging AI-Powered Search Engines
Figure 3 for The Rising Threat to Emerging AI-Powered Search Engines
Figure 4 for The Rising Threat to Emerging AI-Powered Search Engines
Viaarxiv icon

SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning

Add code
Feb 06, 2025
Figure 1 for SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
Figure 2 for SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
Figure 3 for SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
Figure 4 for SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
Viaarxiv icon

Are We in the AI-Generated Text World Already? Quantifying and Monitoring AIGT on Social Media

Add code
Dec 24, 2024
Viaarxiv icon

On the Generalization Ability of Machine-Generated Text Detectors

Add code
Dec 23, 2024
Figure 1 for On the Generalization Ability of Machine-Generated Text Detectors
Figure 2 for On the Generalization Ability of Machine-Generated Text Detectors
Figure 3 for On the Generalization Ability of Machine-Generated Text Detectors
Figure 4 for On the Generalization Ability of Machine-Generated Text Detectors
Viaarxiv icon

Quantized Delta Weight Is Safety Keeper

Add code
Nov 29, 2024
Figure 1 for Quantized Delta Weight Is Safety Keeper
Figure 2 for Quantized Delta Weight Is Safety Keeper
Figure 3 for Quantized Delta Weight Is Safety Keeper
Figure 4 for Quantized Delta Weight Is Safety Keeper
Viaarxiv icon