Picture for Jingkai Liu

Jingkai Liu

AInsteinBench: Benchmarking Coding Agents on Scientific Repositories

Add code
Dec 24, 2025
Viaarxiv icon

LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics

Add code
Dec 24, 2025
Viaarxiv icon

LPFQA: A Long-Tail Professional Forum-based Benchmark for LLM Evaluation

Add code
Nov 09, 2025
Viaarxiv icon

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Add code
Sep 04, 2025
Viaarxiv icon

Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning

Add code
Jun 10, 2024
Figure 1 for Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning
Figure 2 for Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning
Figure 3 for Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning
Figure 4 for Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning
Viaarxiv icon