Part Benchmark


Sim2Real in endoscopy segmentation with a novel structure aware image translation

Add code
May 05, 2025
Viaarxiv icon

LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis

Add code
May 05, 2025
Viaarxiv icon

AI agents may be worth the hype but not the resources (yet): An initial exploration of machine translation quality and costs in three language pairs in the legal and news domains

Add code
May 02, 2025
Viaarxiv icon

WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks

Add code
Apr 30, 2025
Viaarxiv icon

AGHI-QA: A Subjective-Aligned Dataset and Metric for AI-Generated Human Images

Add code
Apr 30, 2025
Viaarxiv icon

ADiff4TPP: Asynchronous Diffusion Models for Temporal Point Processes

Add code
Apr 29, 2025
Viaarxiv icon

ReasonIR: Training Retrievers for Reasoning Tasks

Add code
Apr 29, 2025
Viaarxiv icon

ResearchCodeAgent: An LLM Multi-Agent System for Automated Codification of Research Methodologies

Add code
Apr 28, 2025
Viaarxiv icon

3DPyranet Features Fusion for Spatio-temporal Feature Learning

Add code
Apr 26, 2025
Viaarxiv icon

Sliced-Wasserstein Distance-based Data Selection

Add code
Apr 17, 2025
Viaarxiv icon