Picture for Yuxuan Zhu

Yuxuan Zhu

UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench

Add code
Jun 10, 2025
Viaarxiv icon

Embed Progressive Implicit Preference in Unified Space for Deep Collaborative Filtering

Add code
May 28, 2025
Viaarxiv icon

ELT-Bench: An End-to-End Benchmark for Evaluating AI Agents on ELT Pipelines

Add code
Apr 07, 2025
Viaarxiv icon

SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching

Add code
Apr 01, 2025
Viaarxiv icon

CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities

Add code
Mar 21, 2025
Viaarxiv icon

Will Large Language Models be a Panacea to Autonomous Driving?

Add code
Sep 24, 2024
Figure 1 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 2 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 3 for Will Large Language Models be a Panacea to Autonomous Driving?
Figure 4 for Will Large Language Models be a Panacea to Autonomous Driving?
Viaarxiv icon

On the Robustness of Graph Reduction Against GNN Backdoor

Add code
Jul 02, 2024
Viaarxiv icon

FedTrans: Efficient Federated Learning via Multi-Model Transformation

Add code
Apr 25, 2024
Viaarxiv icon

Feature Attribution with Necessity and Sufficiency via Dual-stage Perturbation Test for Causal Explanation

Add code
Feb 13, 2024
Viaarxiv icon

Where and How to Attack? A Causality-Inspired Recipe for Generating Counterfactual Adversarial Examples

Add code
Dec 21, 2023
Viaarxiv icon