Picture for Shangqing Tu

Shangqing Tu

Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Add code
Jun 04, 2025
Viaarxiv icon

MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos

Add code
Jun 04, 2025
Viaarxiv icon

LecEval: An Automated Metric for Multimodal Knowledge Acquisition in Multimedia Learning

Add code
May 04, 2025
Viaarxiv icon

Shifting Long-Context LLMs Research from Input to Output

Add code
Mar 07, 2025
Viaarxiv icon

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Add code
Feb 20, 2025
Viaarxiv icon

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Add code
Dec 19, 2024
Figure 1 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 2 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 3 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Figure 4 for LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks
Viaarxiv icon

Awaking the Slides: A Tuning-free and Knowledge-regulated AI Tutoring System via Language Model Coordination

Add code
Sep 11, 2024
Viaarxiv icon

From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents

Add code
Sep 05, 2024
Figure 1 for From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents
Figure 2 for From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents
Figure 3 for From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents
Figure 4 for From MOOC to MAIC: Reshaping Online Teaching and Learning through LLM-driven Agents
Viaarxiv icon

R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models

Add code
Jun 17, 2024
Viaarxiv icon

Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack

Add code
Jun 17, 2024
Figure 1 for Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack
Figure 2 for Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack
Figure 3 for Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack
Figure 4 for Knowledge-to-Jailbreak: One Knowledge Point Worth One Attack
Viaarxiv icon