Picture for Jielin Qiu

Jielin Qiu

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering

Add code
Nov 17, 2025
Figure 1 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 2 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 3 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 4 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Viaarxiv icon

GeoGNN: Quantifying and Mitigating Semantic Drift in Text-Attributed Graphs

Add code
Nov 12, 2025
Viaarxiv icon

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering

Add code
Sep 11, 2025
Figure 1 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 2 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 3 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 4 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Viaarxiv icon

MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning

Add code
May 30, 2025
Figure 1 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 2 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 3 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Figure 4 for MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning
Viaarxiv icon

Evaluating Durability: Benchmark Insights into Multimodal Watermarking

Add code
Jun 06, 2024
Figure 1 for Evaluating Durability: Benchmark Insights into Multimodal Watermarking
Figure 2 for Evaluating Durability: Benchmark Insights into Multimodal Watermarking
Figure 3 for Evaluating Durability: Benchmark Insights into Multimodal Watermarking
Figure 4 for Evaluating Durability: Benchmark Insights into Multimodal Watermarking
Viaarxiv icon

Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition

Add code
Mar 19, 2024
Figure 1 for Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Figure 2 for Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Figure 3 for Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Figure 4 for Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition
Viaarxiv icon

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

Add code
Mar 07, 2024
Figure 1 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 2 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 3 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 4 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Viaarxiv icon

Offline Reinforcement Learning with Imbalanced Datasets

Add code
Jul 29, 2023
Viaarxiv icon

Embodied Executable Policy Learning with Language-based Scene Summarization

Add code
Jun 09, 2023
Viaarxiv icon

MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos

Add code
Jun 07, 2023
Figure 1 for MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Figure 2 for MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Figure 3 for MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Figure 4 for MultiSum: A Dataset for Multimodal Summarization and Thumbnail Generation of Videos
Viaarxiv icon