Part Benchmark


Aligning Forest and Trees in Images and Long Captions for Visually Grounded Understanding

Add code
Feb 03, 2026
Viaarxiv icon

FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation

Add code
Feb 03, 2026
Viaarxiv icon

ProxyImg: Towards Highly-Controllable Image Representation via Hierarchical Disentangled Proxy Embedding

Add code
Feb 02, 2026
Viaarxiv icon

From Videos to Conversations: Egocentric Instructions for Task Assistance

Add code
Feb 01, 2026
Viaarxiv icon

Segment Any Events with Language

Add code
Jan 30, 2026
Viaarxiv icon

APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation

Add code
Jan 31, 2026
Viaarxiv icon

MasalBench: A Benchmark for Contextual and Cross-Cultural Understanding of Persian Proverbs in LLMs

Add code
Jan 29, 2026
Viaarxiv icon

Disentangling multispecific antibody function with graph neural networks

Add code
Jan 30, 2026
Viaarxiv icon

Shape of Thought: Progressive Object Assembly via Visual Chain-of-Thought

Add code
Jan 28, 2026
Viaarxiv icon

Distillation-based Layer Dropping (DLD): Effective End-to-end Framework for Dynamic Speech Networks

Add code
Jan 27, 2026
Viaarxiv icon