Topic


AI Scientist via Synthetic Task Scaling

Add code
Mar 17, 2026
Viaarxiv icon

WildDepth: A Multimodal Dataset for 3D Wildlife Perception and Depth Estimation

Add code
Mar 17, 2026
Viaarxiv icon

CritiSense: Critical Digital Literacy and Resilience Against Misinformation

Add code
Mar 17, 2026
Viaarxiv icon

SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding

Add code
Mar 17, 2026
Viaarxiv icon

A Context Alignment Pre-processor for Enhancing the Coherence of Human-LLM Dialog

Add code
Mar 17, 2026
Viaarxiv icon

MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning

Add code
Mar 17, 2026
Viaarxiv icon

CLAG: Adaptive Memory Organization via Agent-Driven Clustering for Small Language Model Agents

Add code
Mar 16, 2026
Viaarxiv icon

Information Asymmetry across Language Varieties: A Case Study on Cantonese-Mandarin and Bavarian-German QA

Add code
Mar 16, 2026
Viaarxiv icon

Seamless Deception: Larger Language Models Are Better Knowledge Concealers

Add code
Mar 15, 2026
Viaarxiv icon

MedArena: Comparing LLMs for Medicine-in-the-Wild Clinician Preferences

Add code
Mar 13, 2026
Viaarxiv icon